Scraping web data in R
In my last post, I went through a lot of effort to scrape the PMI index off the ISM website. It turns out that was unnecessary effort, as commentator “senne” pointed out that this index is available from FRED, with the symbol NAPM. I’ve updated my code, which now pulls all the data straight from FRED.
However, it was surprisingly easy to scrape web data into R, using the readHTMLTable function in the XML package. I thought I’d keep the code I used on my blog, as it’s a good example of how easily you can pull web data into R.
To leave a comment
for the author, please follow the link and comment on their blog: Modern Tool Making
offers daily e-mail updates
news and tutorials
on topics such as: Data science
, Big Data, R jobs
, visualization (ggplot2
), programming (RStudio
, Web Scraping
) statistics (regression
, time series
) and more...
If you got this far, why not subscribe for updates
from the site? Choose your flavor: e-mail
, or facebook