In case you missed them, here are some articles from January of particular interest to R users:
Princeton’s Germán Rodríguez has published a useful “Introduction to R” guide, with a focus on linear and logistic regression.
The rxDForest function in the RevoScaleR package fits random forests of histogram-binning trees.
In a video interview with Trevor Hastie, John Chambers recounts the history of S and R.
A review of “Doing Data Science”, a new book by Rachel Schutt and Cathy O'Neil.
Hadley Wickham introduces the dplyr package, with its “grammar of data manipulation”.
The new choroplethr package makes it easier to create data maps in R.
A developer preview of SparkR, an interface between R and Apache Spark, is now available.
Joseph Rickert reviews the capabilities of R for topological data analysis.
In a recent survey of data scientists, R is the most-used software tool other than SQL.
A new JSS article on computing with massive data, and a change in policy for acceptable JSS software licenses.
Large scale optimization with the optim and rxDataStep functions.
A preview of a forthcoming update to Max Kuhn’s caret package, and an interactive chart showing similarities amongst the 143 statistical and machine learning models it supports.
Simulation-based forecasts by combining expert opinion.
Tips on getting R help.
The top 10 posts on the Revolutions blog from 2013.
Some non-R stories in the past month included: levitation with sound waves, the NYT’s 4th Down Bot, bird flight patterns, camera magic on Vine, visualizing whisky flavor profiles, the Cornsweet illusion, and a one-sentence explanation of the Fourier Transform.
As always, thanks for the comments and please send any suggestions to me at [email protected]. Don't forget you can follow the blog using an RSS reader, via email using blogtrottr, or by following me on Twitter (I'm @revodavid). You can find roundups of previous months here.