Setting up AWS Cluster to use snow in R

[This article was first published on indiacrunchin » R, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

Setting up AWS Cluster

I wanted to setup an AWS cluster to take a shot at a Kaggle contest – DunnHumby Challenge

http://www.kaggle.com/c/dunnhumbychallenge

For this, I found StarCluster to be of great help. It allows you to set-up AWS nodes in a few lines of code and does much more (choosing AMIs and cluster configurations)

http://web.mit.edu/stardev/cluster/

Make sure you use the Bioconductor AMI which comes bundled with R and a host of installed packages.

http://www.bioconductor.org/help/bioconductor-cloud-ami/

I used the package “snowfall” for parallel processing.

Relevant SO questions I had asked

http://stackoverflow.com/questions/7241244/using-aws-for-parallel-processing-with-r

http://stackoverflow.com/questions/7333801/using-snow-and-snowfall-with-aws-for-parallel-processing-in-r


To leave a comment for the author, please follow the link and comment on their blog: indiacrunchin » R.

R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)