(This article was first published on

**indiacrunchin » R**, and kindly contributed to R-bloggers)Setting up AWS Cluster

I wanted to setup an AWS cluster to take a shot at a Kaggle contest – DunnHumby Challenge

http://www.kaggle.com/c/dunnhumbychallenge

For this, I found StarCluster to be of great help. It allows you to set-up AWS nodes in a few lines of code and does much more (choosing AMIs and cluster configurations)

http://web.mit.edu/stardev/cluster/

Make sure you use the Bioconductor AMI which comes bundled with R and a host of installed packages.

http://www.bioconductor.org/help/bioconductor-cloud-ami/

I used the package “snowfall” for parallel processing.

Relevant SO questions I had asked

http://stackoverflow.com/questions/7241244/using-aws-for-parallel-processing-with-r

To

**leave a comment**for the author, please follow the link and comment on their blog:**indiacrunchin » R**.R-bloggers.com offers

**daily e-mail updates**about R news and tutorials on topics such as: Data science, Big Data, R jobs, visualization (ggplot2, Boxplots, maps, animation), programming (RStudio, Sweave, LaTeX, SQL, Eclipse, git, hadoop, Web Scraping) statistics (regression, PCA, time series, trading) and more...