Blog Archives

R is 15th of top programming languages in latest RedMonk ranking

February 3, 2014
By
R is 15th of top programming languages in latest RedMonk ranking

Analyst firm RedMonk periodically publishes rankings of the Top 20 programming languages, as measured by activity on StackOverflow and number of repositories on GitHub. In their most recent ranking (January 2014), R is ranked #15 amongst all programming languages. An impressive ranking for a domain-specific language; the top 3 were the general-purpose languages Java, Javascript and PHP. Here's a...

Read more »

Princeton’s guide to linear modeling and logistic regression with R

January 31, 2014
By
Princeton’s guide to linear modeling and logistic regression with R

If you're new to the R language but keen to get started with linear modeling or logistic regression in the language, take a look at this "Introduction to R" PDF, by Princeton's Germán Rodríguez. (There's also a browsable HTML version.) In a crisp 35 pages it begins by taking you through the basics of R: simple objects, importing data,...

Read more »

NYT’s 4th Down Bot gives the SuperBowl edge to the Broncos

January 29, 2014
By

Who will win the SuperBowl this Sunday: Seattle or Denver? As pundits around the country weigh in with their predictions, you might want to check out the analysis from the New York Times' 4th Down Bot, which compares the coaches' calls on fourth down plays with what historical statistics and a point-forecasting model indicate would have been the ideal...

Read more »

John Chambers recounts the history of S and R

January 27, 2014
By

"R has had a revolutionary effect on the way statistics are communicated." So says John Chambers: one of the members of the R-core team overseeing R; and co-inventor of the S language. In this interview with Trevor Hastie (his co-author on Statistical Models in S), John Chambers recounts his involvement in the birth of the S language in 1976,...

Read more »

Demo this Wednesday: Drag-and-drop to create R-based workflows

January 24, 2014
By
Demo this Wednesday: Drag-and-drop to create R-based workflows

Want to see how you can use a drag-and-drop user interface to run and share R code? Check out our webinar next Wednesday January 29 (hosted by Alteryx and Revolution Analytics): Creating Value That Scales with Revolution Analytics & Alteryx. In the webinar, Dan Putler (Alteryx's Data Artisan in Residence) will demonstrate the Alteryx drag-and-drop Alteryx GUI, which provides...

Read more »

Fast and easy data munging, with dplyr

January 22, 2014
By

RStudio's Hadley Wickham has just introduced a new package for filtering, selecting, restructuring and aggregating tabular data in R: the dplyr package. It's similar in concept to Hadley's original plyr package from 2009, but with several key improvements: It works exclusively with data in R data frames; It can process data in remote databases (with the transformations done in-database...

Read more »

Easy data maps with R: the choroplethr package

January 21, 2014
By
Easy data maps with R: the choroplethr package

Choropleth maps are a popular way of representing spatial or geographic data, where a statistic of interest (say, income, voting results or crime rate) are color-coded by region. R includes all of the necessary tools for creating choropleth maps, but Trulia's Ari Lamstein has made the process even easier with the new choroplethr package now available on github. With...

Read more »

In case you missed it: December 2013 Roundup

January 17, 2014
By

In case you missed them, here are some articles from December of particular interest to R users: A ComputerWorld tutorial on basic data processing with R. Prediction: R will replace legacy SAS solutions and go mainstream. A chart of the growth of R user groups and local R meetings. I discussed R, data science and big data in an...

Read more »

In data scientist survey, R is the most-used tool (other than databases)

January 15, 2014
By
In data scientist survey, R is the most-used tool (other than databases)

O'Reilly has just published the results of the Data Scientist Salary Survey, based on data collected from attendees of the O'Reilly Strata conferences in 2012 and 2013. There were some interesting results from the salary portion of the survey: data scientists at early-stage startups earned a median salary of US$130,000 data scientists at public companies earned a higher median...

Read more »

Where the whisky flavor profile data came from

January 14, 2014
By
Where the whisky flavor profile data came from

Our crack-shot R trainer Luba Gloukhov generated a spirited (pun intended!) discussion from her post K-means Clustering 86 Single Malt Scotch Whiskies, with mentions of her analysis at FlowingData and Reddit amongst others. Other bloggers took a look at the data too, notably Christopher Ingraham who created this beautiful infographic of the flavour profiles of the 86 whiskies from...

Read more »