Monthly Archives: March 2011

R/Finance 2011 Registration Open

March 14, 2011
By

The registration for R/Finance 2011--which will take place April 29 and 30 in Chicago--is NOW OPEN!Building on the success of the two previous conferences in 2009 and 2010, we are expecting more than 250 attendees from around the world representing bot...

Read more »

Amanda Cox on How The New York Times Graphics Department Uses R

March 14, 2011
By

Last month, Amanda Cox from The New York Times Graphic Department gave a great talk to the NYC R Statistical Programming Meetup. I’ve just got around to uploading the video, which has been broken into a part one and part two. You can also view the videos embedded after the jump. Amanda made use of

Read more »

Language used by Academics with the Protection of Anonymity

March 14, 2011
By
Language used by Academics with the Protection of Anonymity

Those in the political science discipline probably remember their first encounter with poliscijobrumors.com. For those outside, you have probably never heard of this particular message board, and you would have no reason to. As the URL suggests, the board specializes in rumor, gossip, back-bitting, mudslinging, and the occasional lucid thread on the political science

Read more »

R 2.13.0 scheduled for April 13

March 14, 2011
By

As announced yesterday by the R Core Team, the next major update to R will be released on April 13. R 2.13.0 is the next major release of R, which gets major updates approximately every six months. This also indicates that R 2.12.2 is the last patch level of the R 2.12 series, and so the next version of...

Read more »

R Tutorial Series: Applying the Reshape Package to Organize ANOVA Data

March 14, 2011
By
R Tutorial Series: Applying the Reshape Package to Organize ANOVA Data

As demonstrated in the preceding ANOVA tutorials, data organization is central to conducting ANOVA in R. In standard ANOVA, we used the tapply() function to generate a table for a single summary function. In repeated measures ANOVA, we used separate da...

Read more »

Hacker News Analysis

March 13, 2011
By
Hacker News Analysis

I was playing around with the Hacker News database Ronnie Roller made (thanks!), so I thought I’d post some of my findings. Activity on the Site My first question was: how has activity on the site increased over time? I … Continue reading →

Read more »

Piiikaaachuuuuuu vs. KHAAAAAN!

March 13, 2011
By
Piiikaaachuuuuuu vs. KHAAAAAN!

This is a fun image I found on Neil Kodner’s blog: But I’ve never actually watched any of the Star Trek movies, so I decided to recreate the graph with Pikachu instead: Here’s a smoothed version to better compare the counts … Continue reading →

Read more »

A Kernel Density Approach to Outlier Detection

March 13, 2011
By
A Kernel Density Approach to Outlier Detection

I describe a kernel density approach to outlier detection on small datasets. In particular, my model is the set of prices for a given item that can be found online. Introduction Suppose you’re searching online for the cheapest place to … Continue reading →

Read more »

Eigensheep

March 13, 2011
By
Eigensheep

Aaron Koblin’s Sheep Market visualization is an awesome use of Mechanical Turk. But it’d be even more awesome if the grid were ordered, so inspired by the use of eigenfaces in facial recognition, I decided to try projecting the sheep … Continue reading →

Read more »

Counting Clusters

March 13, 2011
By
Counting Clusters

Given a set of numerical datapoints, we often want to know how many clusters the datapoints form. Two practical algorithms for determining the number of clusters are the gap statistic and the prediction strength. Gap Statistic The gap statistic algorithm … Continue reading →

Read more »

Sponsors

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)