New R package raincpc: Obtain and Analyze Rainfall data from the Climate Prediction Center (CPC)

November 26, 2013

(This article was first published on Nine Lives, and kindly contributed to R-bloggers)

The Climate Prediction Center’s (CPC) daily rainfall data for the entire world, 1979 – present & 50-km resolution, is one of the few high quality and long term observation-based rainfall products. Data is available at CPC’s ftp site. However, it is a lot of data and there is no software to analyze and visualize the data.

Some issues with size/format of the CPC data:

  • too many files (365/366 files per year * 34 years, separate folder for each year)
  • each file has 360 rows and 720 columns
  • file naming conventions have changed over time – one format prior to 2006 and couple of different formats afterwards
  • file formats have changed over time – gzipped files prior to 2008 and plain binary files afterwards
  • downloading multiple files simultaneously from the CPC ftp site, using wget, does not seem to work properly
  • there is no software/code readily available to easily process/visualize the data

The R package `raincpc` makes life easier by providing functionality to download and process the data from CPC’s ftp site. Some features of this new package are:

  • Data for anytime period during 1979-present can be downloaded and processed
  • Just two functions required: one to download the data (`cpc_get_rawdata`) and another to process the downloaded data (`cpc_read_rawdata`)
  • Making spatial maps using the processed data is easy, via ggplot

Here are some examples on how to obtain and visualize the data –

Below are the relevant CRAN and GitHub sites:

Please let me know if you find any errors or if you have any comments or suggestions.

To leave a comment for the author, please follow the link and comment on their blog: Nine Lives. offers daily e-mail updates about R news and tutorials on topics such as: Data science, Big Data, R jobs, visualization (ggplot2, Boxplots, maps, animation), programming (RStudio, Sweave, LaTeX, SQL, Eclipse, git, hadoop, Web Scraping) statistics (regression, PCA, time series, trading) and more...

If you got this far, why not subscribe for updates from the site? Choose your flavor: e-mail, twitter, RSS, or facebook...

Comments are closed.

Search R-bloggers


Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)