Taking R to the Limit, Part I – Parallelization in R

July 28, 2010

(This article was first published on Byte Mining, and kindly contributed to R-bloggers)


Tuesday night I had the opportunity to present on high performance computing in R, and the Los Angeles R Users’ Group. There was so much to talk about that I had to split my talk into two parts. The first part was parallelization and the second part will be big data (and a bit left over from parallelization including Hadoop).

My slides are posted on SlideShare, and available for download here.

The corresponding demonstration code is here.

Topics included:

  • Rmpi
  • snow
  • snowfall and sfCluster
  • multicore
  • foreach
  • brief mention of CUDA and GPUs

Video of the presentation with my commentary:

The video was created with Vara ScreenFlow and I am very happy with how easy it is to use and how painless editing was.

For Part 2, Large Datasets in R, click here.

To leave a comment for the author, please follow the link and comment on their blog: Byte Mining.

R-bloggers.com offers daily e-mail updates about R news and tutorials on topics such as: Data science, Big Data, R jobs, visualization (ggplot2, Boxplots, maps, animation), programming (RStudio, Sweave, LaTeX, SQL, Eclipse, git, hadoop, Web Scraping) statistics (regression, PCA, time series, trading) and more...

If you got this far, why not subscribe for updates from the site? Choose your flavor: e-mail, twitter, RSS, or facebook...


Comments are closed.

Search R-bloggers


Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)