Taking R to the Limit, Part I – Parallelization in R

July 28, 2010
By

(This article was first published on Byte Mining, and kindly contributed to R-bloggers)

Rlogo

Tuesday night I had the opportunity to present on high performance computing in R, and the Los Angeles R Users’ Group. There was so much to talk about that I had to split my talk into two parts. The first part was parallelization and the second part will be big data (and a bit left over from parallelization including Hadoop).

My slides are posted on SlideShare, and available for download here.

The corresponding demonstration code is here.

Topics included:

  • Rmpi
  • snow
  • snowfall and sfCluster
  • multicore
  • foreach
  • brief mention of CUDA and GPUs

Video of the presentation with my commentary:


The video was created with Vara ScreenFlow and I am very happy with how easy it is to use and how painless editing was.

For Part 2, Large Datasets in R, click here.

To leave a comment for the author, please follow the link and comment on his blog: Byte Mining.

R-bloggers.com offers daily e-mail updates about R news and tutorials on topics such as: visualization (ggplot2, Boxplots, maps, animation), programming (RStudio, Sweave, LaTeX, SQL, Eclipse, git, hadoop, Web Scraping) statistics (regression, PCA, time series, trading) and more...



If you got this far, why not subscribe for updates from the site? Choose your flavor: e-mail, twitter, RSS, or facebook...

Tags:

Comments are closed.