Digital Data Collection course

March 20, 2015

(This article was first published on Quantifying Memory, and kindly contributed to R-bloggers)

Another year, another web scraping course. Taught through SSRMC at the University of Cambridge. Below are slides from all three sessions.

In the course I tried to achieve the following:
– Show how to connect R to resources online
– Use loops and functions to iteratively access online content
– How to work with APIs
– How to harvest data manually using Xpath expressions.

What’s new?
– Many more examples and practice tasks
– Updated API usage
– Some bug fixes (and probably many new bugs introduced)

Slides from last year’s course:
session one: the basics
session two: digging deeper
session three: scaling up
session four: APIs

To leave a comment for the author, please follow the link and comment on their blog: Quantifying Memory. offers daily e-mail updates about R news and tutorials on topics such as: Data science, Big Data, R jobs, visualization (ggplot2, Boxplots, maps, animation), programming (RStudio, Sweave, LaTeX, SQL, Eclipse, git, hadoop, Web Scraping) statistics (regression, PCA, time series, trading) and more...

If you got this far, why not subscribe for updates from the site? Choose your flavor: e-mail, twitter, RSS, or facebook...

Comments are closed.

Search R-bloggers


Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)