Posts Tagged ‘ R Stuff ’

R at Wikimedia

July 9, 2011
By

Last year Wikipedia rolled out a pilot program to use Wikipedia article creation as an assignment in the classroom. Students wrote articles on a topic area and rather than turning them into a professor and forgetting about it they upload it to Wikipedia and expose it to readers around the world. 24 schools inside the…

Read more »

Wikipedia for Kaggle Participants

July 1, 2011
By

Kaggle has released a new data-mining challenge: use data from 10 years of Wikipedia edits in order to predict future edit rates. The dataset has been anonymized in order to obscure editor identity and article identity, simultaneously adding focus to the challenge and robbing the dataset of considerable richness. I have some experience with wikipedia…

Read more »

No, steal *this* blog!

June 23, 2011
By

Should the world of R Wikibooks require a precis on truncated distributions, marijuana prices, or an obtusely coded method to visualize orthogonal least squares (spoiler alert: dot products are easier!), I’m your man! I hereby release all of the content and code in the R stuff category under the cc-by-sa copyright license. You can do…

Read more »

Resources for Learning R

May 17, 2011
By

The information below will be periodically updated at the folowing permanent link: http://www.backsidesmack.com/r-resources/ Searching for information on R sucks. Not only is the language name a letter of the alphabet (an ignominy it shares with C and some less well known languages), there is Pearson’s r and the coefficient of determination, R squared! if you…

Read more »

Because it’s Friday: French Press Heat Retention

May 13, 2011
By
Because it’s Friday: French Press Heat Retention

While responding to this thread on Reddit I made a rough guess as to the heat retention of my french press when completely full of coffee. When I went to bed I realized there was no good reason why I … Continue reading →

Read more »

Late to the party for R in Finance blogging

May 10, 2011
By
Late to the party for R in Finance blogging

I meant to blog about the R/Finance conference during a lull, but I didn’t find too many. Unlike many conferences I’ve been to the structure of R/Finance was simple: one room and one speaker at a time. Relying on each … Continue reading →

Read more »

What value is cross country GDP correlation? [Part One]

May 6, 2011
By
What value is cross country GDP correlation? [Part One]

The above graph borders on chartjunk (and is nothing like Paul Butler’s amazing Facebook map). We can see some variation in color but mostly it is a set of lines between 152 country capitals with no means to determine which … Continue reading →

Read more »

Measuring Price Dispersion of Marijuana

April 12, 2011
By
Measuring Price Dispersion of Marijuana

The intersection of mapping APIs, fast database operations and user engagement offers a lot of very cool crowdsourcing applications ranging from the benign and powerful (Google’s Person Finder) to the minor and questionable (A DUI checkpoints app). Most intriguing in … Continue reading →

Read more »

A Simple Instrumental Variables Problem

March 29, 2011
By
A Simple Instrumental Variables Problem

When working with labor economics, we often run into issues with selection on variables of interest. Regressing earnings on years of education to estimate the human capital earnings function makes sense at first blush until we imagine that education is … Continue reading →

Read more »

Radiation levels at Fukushima

March 24, 2011
By
Radiation levels at Fukushima

From BWR The above graph is derived from data scraped from TEPCO press releases. Every hour or so for the first few days of the crisis, a TEPCO van would record radiation (probably Beta/Gamma, but the translation is unclear) at … Continue reading →

Read more »