Monthly Archives: June 2013

R and MongoDB

June 7, 2013
By
R and MongoDB

MongoDB is a document-based noSQL database. Different from the relational database storing data in tables with rigid schemas, MongoDB stores data in documents with dynamic schemas. In the demonstration below, I am going to show how to extract data from a MongoDB with R. Before starting the R session, we need to install the MongoDB

Read more »

Hey, I Just did a Significance Test!

June 7, 2013
By
Hey, I Just did a Significance Test!

I’ve seen it happens quite often. The sig test. Somebody simply needs to know the p-value and that one number will provide all of the information about the study that they need to know. The dataset is presented and the client/boss/colleague/etc invariably asks the question “is it significant?” and “what’s the correlation?”. To quote R.A.

Read more »

Robust logistic regression

June 7, 2013
By

Corey Yanofsky writes: In your work, you’ve robustificated logistic regression by having the logit function saturate at, e.g., 0.01 and 0.99, instead of 0 and 1. Do you have any thoughts on a sensible setting for the saturation values? My intuition suggests that it has something to do with proportion of outliers expected in the The post Robust...

Read more »

Crayfish or crawdad? Mapping US dialect variations with R

June 7, 2013
By
Crayfish or crawdad? Mapping US dialect variations with R

I grew up in Australia, where I learned to speak English. Or so I thought: when I moved overseas to the UK, and especially when I moved to the States, I soon learned these are distinct cultures separated by a common language. Words which I previously had no context for being different anywhere else, such as "runners" ("sneakers"), "lemonade"...

Read more »

The Rcpp Book is now shipping

My book about Rcpp (and its R and C++ integration) is now available from Springer.Amazon still lists it as not-yet-released; I expect this to change in the next few days.

Read more »

Happy Birthday rasterVis!

Happy Birthday rasterVis!

Two years ago the first version of rasterVis was submitted to R-Forge and some weeks after the first stable version was …Continuar leyendo »

Read more »

A Shiny App Goes Viral

June 7, 2013
By
A Shiny App Goes Viral

I am not sure how many of you have seen this Business Insider article.  It is basically about a shiny app created by Joshua Katz as NC State.  It is really fun playing with shiny app.With nearly a million facebook likes this web app buil...

Read more »

Income Distribution in London

June 7, 2013
By
Income Distribution in London

Inspired by the Institute of Fiscal Studies' "Where do you fit in" application, where people can find out their position in the UK's income distribution, I wanted to find out how the picture in London looks like. Quite different. If you are in a very high percentile nationwide, high incomes of mainly financial sector employees in London...

Read more »

Symmetric set differences in R

June 7, 2013
By

My .Rprofile contains a collection of convenience functions and function abbreviations. These are either functions I use dozens of times a day and prefer not to type in full:## my abbreviation of head() h Or problems that I'd rather figure out once, and only once:## example: ## between( 1:10, 5.5, 6.5 ) between = low & x low & x...

Read more »

Comrades Marathon Attrition Rate

June 7, 2013
By
Comrades Marathon Attrition Rate

It is a bit of a mission to get the complete data set for this year’s Comrades Marathon. The full results are easily accessible, but come as an HTML file. Embedded in this file are links to the splits for individual athletes. So with a bit of scripting wizardry it is also possible to download

Read more »