It's not always easy getting the right size. The image is CC by Kristina Gill. A vital part of statistics is producing nice plots, an area where R is outstanding. The graphical ablility of R...

The most recent edition of the Revolution Newsletter is out. The news section is below, and you can read the full February edition (with highlights from this blog and community events) online. You can subscribe to the Revolution Newsletter to get it monthly via email. Case study: Real-Time Marketing Analytics. Online advertising technology company Exelate uses predictive models to...

Add references and a style sheet Below I briefly outline why Pandoc is an essential part of my research workflow, and demonstrate how to seamlessly integrate it with a bibliographic system and code written in R to produce high quality word or pdf documents. I also...

Exploring the quality of predictions using random portfolios and optimization. Previously “Simple tests of predicted returns” showed a few ways to look at expected returns at the asset level. Here we move to the portfolio level. The previous post focused on correlation. Win Vector Blog points out that gauging prediction quality using correlation can be … Continue reading...

Or, How good were the Maddux/Glavine-era Braves?In this on-going series of posts about run scoring in Major League Baseball, for this installment I'll turn the equation around and look at runs allowed. In order to account for the changing run scoring environments, the runs allowed by individual teams is compared to the league average for that season,...