Scraping web data in R

August 10, 2011
By

(This article was first published on Modern Tool Making, and kindly contributed to R-bloggers)

In my last post, I went through a lot of effort to scrape the PMI index off the ISM website.  It turns out that was unnecessary effort, as commentator "senne" pointed out that this index is available from FRED, with the symbol NAPM.  I've updated my code, which now pulls all the data straight from FRED.

However, it was surprisingly easy to scrape web data into R, using the readHTMLTable function in the XML package.  I thought I'd keep the code I used on my blog, as it's a good example of how easily you can pull web data into R.




To leave a comment for the author, please follow the link and comment on his blog: Modern Tool Making.

R-bloggers.com offers daily e-mail updates about R news and tutorials on topics such as: visualization (ggplot2, Boxplots, maps, animation), programming (RStudio, Sweave, LaTeX, SQL, Eclipse, git, hadoop, Web Scraping) statistics (regression, PCA, time series, trading) and more...



If you got this far, why not subscribe for updates from the site? Choose your flavor: e-mail, twitter, RSS, or facebook...

Comments are closed.