An excellent introduction to MapReduce and Hadoop

July 19, 2013

(This article was first published on, and kindly contributed to R-bloggers)

by Yanchang Zhao,

The lectures in week 3 of a free online course Introduction to Data Science give an excellent introduction to MapReduce and Hadoop, and demonstrate with examples how to use MapReduce to do various tasks, such as, word frequency counting, matrix multiplication, simple social network analysis, and a join operation like in a relational database. There are also interesting comparisons with relational DB. The examples look simple, but they are scalable and can handle really Big Data. The course also introduces NoSQL systems.

Although the course has been closed, all lecture videos can be accessed via the “Preview” button on the course page at the above link.

They are definitely worth watching if you want to get some idea about MapReduce and Hadoop.

To leave a comment for the author, please follow the link and comment on their blog: offers daily e-mail updates about R news and tutorials on topics such as: visualization (ggplot2, Boxplots, maps, animation), programming (RStudio, Sweave, LaTeX, SQL, Eclipse, git, hadoop, Web Scraping) statistics (regression, PCA, time series, trading) and more...

If you got this far, why not subscribe for updates from the site? Choose your flavor: e-mail, twitter, RSS, or facebook...

Comments are closed.