After my recent posts fiddling around with heat maps for pitch location, Jason at It's About the Money, Stupid contacted me to ask if I would contribute some location maps for Yankee pitchers. Obviously, I couldn't pass up the chance to contribute to ...
As this year's admin, I wrote up the following summary which has now been
posted at the R site in the appropriate slot. My thanks to
this year's students, fellow mentors and everybody else who helped to make it
happen.
Projects 2010
As in 2008 and
2009,
the R Project has again participated ... [Read more...]
Last week, George Casella and I worked around the clock on starting the third edition of Monte Carlo Statistical Methods by detailing the changes to make and designing the new table of contents. The new edition will not see a revolution in the presentation of the material but rather a ...
Just announced: World Bank Data features and data are available. Previous posts have demonstrated how to access and plot this data using R (including the use of the R WDI package). The chart above can be created using the following pr... [Read more...]
R is designed to make it easy to clearly express statistical ideas in code, but when it come to writing code that runs as fast as possible, there are a few tips, tricks and caveats to be aware of. As part of the BioConductor conference this past summer, Martin Morgan ... [Read more...]
HOMOPHILY + MAPS WITHOUT MAPPING SOFTWARE In the past, Decision Science News has posted about homophily (“birds of a feather shop together“) and cool, lightweight visualizations (“maps without map packages in R“). Today, both topics come together in Eric Fischer’s fascinating set of images on Flickr called “Race and Ethnicity”(*). ... [Read more...]
A few months ago I switched my laptop from Windows to Ubuntu Linux. I had been connecting to my corporate SQL Server database using RODBC on Windows so I attempted to get ODBC connectivity up and running on Ubuntu. ODBC on Ubuntu turned into an exercise in futility. I spent ... [Read more...]
I compared the results of my fantasy football draft with the results of more than 1500 mock drafts at the Fantasy Football Calculator (FFC). I looked at where player X was drafted in our league, subtracted off the average draft … Continue reading → [Read more...]
The R Function of the Day series will focus on describing in plain language how certain R functions work, focusing on simple examples
that you can apply to gain insight into your own data.
Today, I will discuss the foodweb function, found in... [Read more...]
Higher correlations between oil and equities has been widely discussed in the financial press. Coinciding with the Global Financial Crisis [GFC], correlations in daily returns have risen from their historical +/-0.2 range to over 0.6. Hypotheses for this change vary. Some commentators believe this is evidence of ongoing disruption to markets ... [Read more...]
"The R-Files" is an occasional series from Revolution Analytics, where we profile prominent members of the R Community. Name: Hadley Wickham Profession: Assistant Professor of Statistics, Rice University Nationality: New Zealand Years Using R: 10 Known for: Developing popular R packages including ggplot2, plyr, reshape; creator of crantastic.org; author of ... [Read more...]
In a previous post on classification trees we considered using the tree package to fit a classification tree to data divided into known classes. In this post we will look at the alternative function rpart that is available within the base R distribution.
Fast Tube by Casper
A classification tree ...
How can we change the reference category for a categorical variable? This question comes up often in a consulting practice.When including categorical covariates in regression models, there is a question of how to incorporate the categories. One simpl...
I recently reformatted my laptop and needed to reinstall R and all the packages that I regularly use. In a previous post I covered R Commander, a nice GUI for R that includes a decent data editor and menus for graphics and basic statistical analysis. Since Rcmdr depends on many ... [Read more...]
I gave my introduction to the R course in a crammed amphitheatre of about 200 students today. Had to wear my collectoR teeshirt from Revolution Analytics, even though it only made the kids pay attention for about 30 seconds… The other few “lines” that worked were using the Proctor & Gamble “car 54″ poster ...
This will work for any language you have a syntax file for, but I'll use R as the example.1. Open an R file, and on the plugins menu, click "Word Complete"2. Type a couple of letters e.g. rn and then press ctrl+space (default word suggest keyboard... [Read more...]
Saptarshi Guha (author of the Rhipe package) joins the likes of Ebay, Yahoo, Twitter and Facebook and as one of just 37 presenters at the Hadoop World conference. (Revolution Analytics is proud to sponsor Saptarshi's presence at this event, which take place in New York on October 12.) He'll be talking about ... [Read more...]
Struggling for two whole days to do some not-terribly-advanced repeated-measures analysis of variance in R. Probably my worst experience with R so far and really an example of where R could do with some centralisation of effort. I have explo... [Read more...]