An excellent introduction to MapReduce and Hadoop

July 19, 2013
By

(This article was first published on blog.RDataMining.com, and kindly contributed to R-bloggers)

by Yanchang Zhao, RDataMining.com

The lectures in week 3 of a free online course Introduction to Data Science give an excellent introduction to MapReduce and Hadoop, and demonstrate with examples how to use MapReduce to do various tasks, such as, word frequency counting, matrix multiplication, simple social network analysis, and a join operation like in a relational database. There are also interesting comparisons with relational DB. The examples look simple, but they are scalable and can handle really Big Data. The course also introduces NoSQL systems.

Although the course has been closed, all lecture videos can be accessed via the “Preview” button on the course page at the above link.

They are definitely worth watching if you want to get some idea about MapReduce and Hadoop.

To leave a comment for the author, please follow the link and comment on their blog: blog.RDataMining.com.

R-bloggers.com offers daily e-mail updates about R news and tutorials on topics such as: Data science, Big Data, R jobs, visualization (ggplot2, Boxplots, maps, animation), programming (RStudio, Sweave, LaTeX, SQL, Eclipse, git, hadoop, Web Scraping) statistics (regression, PCA, time series, trading) and more...



If you got this far, why not subscribe for updates from the site? Choose your flavor: e-mail, twitter, RSS, or facebook...

Comments are closed.

Sponsors

Mango solutions



RStudio homepage



Zero Inflated Models and Generalized Linear Mixed Models with R

Quantide: statistical consulting and training



http://www.eoda.de







ODSC

ODSC

CRC R books series











Contact us if you wish to help support R-bloggers, and place your banner here.

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)