An R "meta" book

March 13, 2014
By

(This article was first published on Revolutions, and kindly contributed to R-bloggers)

by Joseph Rickert

I am a book person. I collect books on all sorts of subjects that interest me and consequently I have a fairly extensive collection of R books, many of which I find to be of great value. Nevertheless, when I am asked to recommend an R book to someone new to R I am usually flummoxed. R is growing at a fantastic rate, and people coming to R for the first time span I wide range of sophistication. And besides, owning a book is kind of personal. It is one thing to go out and buy a technical book because it is required for a course, but quite an other to make a commitment to a book all on your own. Not only must it have the right content, at the right level for you, and be written in a way that you will actually read it, a book must feel right, be typeset to appeal to your sense of aesthetics, have diagrams and illustrations to draw you in, and contain enough white space to seem approachable. Furthermore, there is a burden to owning a book. There is nothing worse than making a poor selection and having a totally incomprehensible text stare at you from a shelf. Moreover, even an old friend can impose obligations. I have read deeply from The Elements of Statistical Learning, but not everything, so there it sits: admonishing me.  

Recently, however, while crawling around CRAN, it occurred to me that there is a tremendous amount of high quality material on a wide range of topics in the Contributed Documentation page that would make a perfect introduction to all sorts of people coming to R. Maybe, all it needs is a little marketing and reorganization. So, from among this treasure cache (and a few other online sources), I have assembled an R “meta” book in the following table  that might be called: An R Based Introduction to Probability and Statistics with Applications.

 

Content

Document

Author

1

Basic Probability and Statistics

Introduction to Probability and Statistics Using R

G. Jay Kerns

2

Fitting Probability Distributions

Fitting Distributions with R

Vito Ricci

3

  • Regression
    • Inference
    • Diagnostics
    • Stepwise Regression
    • Ridge Regression
  • ANOVA

Practical Regression and Anova using R

Julian J. Faraway

4

Experimental Design

An R companion to Experimental Design

Vikneswaran

5

Survival Analysis

Cox Proportional-Hazards Regression for Survival Data

John Fox

6

Generalized Linear Models

Analysis of epidemiological data using R and Epicalc

Virasakdi Chongsuvivatwong

7

  • Bootstrap
  • Hierarchical Models
  • Nonlinear Mixed Effects

icebreakeR

Andrew Robinson

8

Time Series

Time Series Analysis with R

McLeod, Yu and Mahdi

9

  • Bayesian Statistics
  • Gibbs Sampler

Statistics Using R with Biological Examples

Kim Seefeld and Ernst Linder

10

Machine Learning

  • Decision Trees and Random Forest
  • Clustering
  • Outlier Detection
  • Time Series Analysis and Mining
  • Text Mining
  • Social Network Analysis

R and Data Mining: Examples and Case Studies

Yanchang Zhao

11

Bioinformatics

  • Cluster Analysis
  • Classification Methods
  • Markov Models
  • Micro Array Analysis

Applied Statistics for Bioinformatics using R

Wim P. Krijnen

12

Forecasting

Forecasting: principles and practice

Hyndman and Athanasopoulos

13

Structural Equation Models

Structural Equation Models

John Fox

14

Credit Scoring

Guide to Credit Scoring in R

Dhruv Sharma

The content column lists the topics that I think ought to be included in a good introductory probability and statistics textbook. With a little searching, you will be able to find a discussion of each topic in the document listed to its right. Obviously, there is a lot overlap among the documents listed, since most of them are substantial works that cover much more than the few topics that I have listed.

Finally, I don’t mean to imply that the documents in my table are the best assembled in the Contributed Documentation page. The table just represents my idiosyncratic way of organizing some of the material in a way that I hope newcomers will find useful. I think that collectively the contributed documents have everything one might look for in a first date with R. They are available, approachable, contain superb content written by R experts, and are replete with examples and R code. And, with a little effort, a casual first encounter could lead to long term relationship.

 

To leave a comment for the author, please follow the link and comment on his blog: Revolutions.

R-bloggers.com offers daily e-mail updates about R news and tutorials on topics such as: visualization (ggplot2, Boxplots, maps, animation), programming (RStudio, Sweave, LaTeX, SQL, Eclipse, git, hadoop, Web Scraping) statistics (regression, PCA, time series, trading) and more...



If you got this far, why not subscribe for updates from the site? Choose your flavor: e-mail, twitter, RSS, or facebook...

Comments are closed.