Monthly Archives: June 2013

R language skills: standard and necessary in today’s world

June 21, 2013
By

A recent Business Times article on Singapore's push to become a tech leader mentions Revolution Analytics new Center of Excellence, set up with the support of the Singapore government to train and grow a pool of data scientists and developers in data science. It includes this quote from SAP: "This will ensure we are equipping our workforce with the...

Read more »

Statistical models are stories about how the data came to be

June 21, 2013
By

And in much of Statistics, the way of telling such stories is through maximum likelihood: given a multitude of possible stories (models), which story is most consistent with the data we actually saw? Dave Harris originates the lovely aphorism above in...

Read more »

Job openings at conservative political analytics firm!

June 21, 2013
By
Job openings at conservative political analytics firm!

After posting that announcement about Civis Analytics, I wrote, “If a reconstituted Romney Analytics team is hiring, let me know and I’ll post that ad too.” Adam Schaeffer obliged: Not sure about Romney’s team, but Evolving Strategies is looking for sharp folks who lean right: Evolving Strategies is a political communications research firm specializing in The post Job...

Read more »

Disposable Visual Data Explorers with Shiny – Guardian University Tables 2014

June 21, 2013
By
Disposable Visual Data Explorers with Shiny – Guardian University Tables 2014

Have data – now what? Building your own interactive data explorer need not be a chore with the R shiny library… Here’s a quick walkthrough… In Datagrabbing Commonly Formatted Sheets from a Google Spreadsheet – Guardian 2014 University Guide Data, I showed how to grab some data from several dozen commonly formatted sheets in a

Read more »

ggplot Tutorial

June 21, 2013
By
ggplot Tutorial

ggplot Tutorial I liked the following ggplot2 tutorial which is featured in Gabriela de Queiroz’s blog called unbiasedestimator. The tutorial looks very neatly presented and I’m sure that it will be very helpful to anyone just getting started with ggplot2 before they jump into ggplot2: Elegant Graphics for Data Analysis by Hadley Wickham or R Graphics Cookbook by...

Read more »

Put some cushions on the sofa

June 21, 2013
By

I posted earlier this week about sofa (here), introducing a package I started recently that interacts with CouchDB from R. There's been a fair amount of response at least in terms of page views, so I'll take that as a sign to keep going. One thing that would be nice while you are CouchDB-ing is to interact with local...

Read more »

Put some cushions on the sofa

June 21, 2013
By

I posted earlier this week about sofa (here), introducing a package I started recently that interacts with CouchDB from R. There's been a fair amount of response at least in terms of page views, so I'll take that as a sign to keep going. One thing that would be nice while you are CouchDB-ing is to interact with local...

Read more »

The PISA2009lite package is released

June 20, 2013
By
The PISA2009lite package is released

This post introduces a new R package named PISA2009lite. I will show how to install this package, what is inside and how to use it. Introduction PISA (Programme for International Student Assessment) is a worldwide study focused on measuring performance of 15-year-old school pupils. More precisely, scholastic performance on mathematics, science and reading is measured

Read more »

Measuring Associations

June 20, 2013
By
Measuring Associations

In Chapter 18, we discuss a relatively new method for measuring predictor importance called the maximal information coefficient (MIC). The original paper is by Reshef at al (2011). A summary of the initial reactions to the MIC are Speed and Tibshirani (and others can be found here). My (minor) beef with it is the lack...

Read more »

Upcoming Rcpp talk in Sydney

June 20, 2013
By

The Sydney Users of R Forum (SURF) will be hosting me for a talk on July 10. The focus will be Rcpp for R and C++ integration, and the intent is to have this be really applied with lots of motivating examples. Organizers Louise and Eugene were able...

Read more »