Here's another great example of R being used to analyze sports data. Statistician and skier Joran Elias has started a project to analyze and visualize international cross country ski racing results, and he publishes his analysis at the blog Statistical Skier. All of the analyses are done using R (and for data, SQLite via the RSQLite package). As much as I love to ski, I'm not so familiar with the intricacies of ski racing as a sport, but it looks like there's a rich source of data ripe for analysis, and Joran is pulling out some interesting results. For example, here's the recent history of one recently-retired five-time Olympian Sabina Valbusa:
I can only assume a low "FIS Points" is better: Joran notes "best season overall might have been 2002-2003". A great use of ggplot2 and smoothing to visualize these data. Check out the blog for some other analyses -- especially if you're a ski junkie.
Statistical Skier: Data based analysis and commentary on nordic skiing
To leave a comment
for the author, please follow the link and comment on his blog: Revolutions
offers daily e-mail updates
news and tutorials
on topics such as: visualization (ggplot2
), programming (RStudio
, Web Scraping
) statistics (regression
, time series
) and more...
If you got this far, why not subscribe for updates
from the site? Choose your flavor: e-mail
, or facebook