If you're looking to get started with data science in R, a great place to start is OnePageR by Graham Williams. (Graham is the creator of Rattle, author of Data Mining with Rattle and R, and Director of Data Science at Microsoft.) This free (CC-licensed) resource is a series of hands-on mini-chapters and associated R code, organized into four main topic areas:
- Data Science: introductions to data science, data mining, literate programming, and the R language
- Dealing with Data: Reading data files and open access data, basic explorations and visualizations, and two case studies
- Building Models: with tutorials for many kinds of models, including association analysis, ensemble models, and multivariate adaptive regression splines
- Advanced R and Analytics: with topics including writing functions, parallel processing, and text mining.
OnePageR is a continual work in progress, and is regularly updated to incorporate advances in R and the R package ecosystem. To download the materials, follow the link below.