Setting up AWS Cluster to use snow in R

November 8, 2011
By

(This article was first published on indiacrunchin » R, and kindly contributed to R-bloggers)

Setting up AWS Cluster

I wanted to setup an AWS cluster to take a shot at a Kaggle contest – DunnHumby Challenge

http://www.kaggle.com/c/dunnhumbychallenge

For this, I found StarCluster to be of great help. It allows you to set-up AWS nodes in a few lines of code and does much more (choosing AMIs and cluster configurations)

http://web.mit.edu/stardev/cluster/

Make sure you use the Bioconductor AMI which comes bundled with R and a host of installed packages.

http://www.bioconductor.org/help/bioconductor-cloud-ami/

I used the package “snowfall” for parallel processing.

Relevant SO questions I had asked

http://stackoverflow.com/questions/7241244/using-aws-for-parallel-processing-with-r

http://stackoverflow.com/questions/7333801/using-snow-and-snowfall-with-aws-for-parallel-processing-in-r


To leave a comment for the author, please follow the link and comment on his blog: indiacrunchin » R.

R-bloggers.com offers daily e-mail updates about R news and tutorials on topics such as: visualization (ggplot2, Boxplots, maps, animation), programming (RStudio, Sweave, LaTeX, SQL, Eclipse, git, hadoop, Web Scraping) statistics (regression, PCA, time series, trading) and more...



If you got this far, why not subscribe for updates from the site? Choose your flavor: e-mail, twitter, RSS, or facebook...

Comments are closed.