Blog Archives

Book Review: Analyzing Baseball Data with R

December 17, 2013
By
Book Review: Analyzing Baseball Data with R

by Max Marchi and Jim Albert (2014, CRC Press)The Sabermetric bookshelf, #3Here we have the perfect book for anyone who stumbles across this blog--the intersection of R and baseball data. The open source statistical programming environment of R is a gr...

Read more »

A few random things

December 1, 2013
By

Read more »

Read more »

Fair weather fans, redux

September 1, 2013
By
Fair weather fans, redux

Fair weather fans, redux Or, A little larger small sample On August 11 the Victoria HarbourCats closed out their 2013 West Coast League season with a 4-3 win over the Bellingham Bells. In an earlier...

Read more »

Fair weather fans? (An R scatter plot matrix)

July 18, 2013
By
Fair weather fans? (An R scatter plot matrix)

(This article was first published on Bayes Ball, and kindly contributed to R-bloggers) The Victoria HarbourCats are roughly half way through their inaugural season in the West Coast League, and currently lead the league in average attendance.  In a recent conversation with one of the team's staff, he mentioned that after the first game in early June, the fans...

Read more »

Annotating select points on an X-Y plot using ggplot2

June 16, 2013
By
Annotating select points on an X-Y plot using ggplot2

or, Is the Seattle Mariners outfield a disaster?The BackstoryEarlier this week (2013-06-10), a blog post by Dave Cameron appeared at USS Mariner under the title “Maybe It's Time For Dustin Ackley To Play Some Outfield”. In the first paragraph, Cameron describes to the Seattle Mariners outfield this season as “a complete disaster” and Raul Ibanez as...

Read more »

Major League Baseball run scoring trends with R’s Lahman package

June 4, 2013
By
Major League Baseball run scoring trends with R’s Lahman package

The statistical software R has an ever-expanding array of packages that provide pre-programmed functions and datasets. One such package is named Lahman, bundling the contents of the Lahman database into a quick-and-easy resource for R users. In addition to the data tables, the package resources also contain a variety of analyses and graphics undertaken using...

Read more »

MLB runs allowed by team

February 24, 2013
By

Or, How good were the Maddux/Glavine-era Braves?In this on-going series of posts about run scoring in Major League Baseball, for this installment I'll turn the equation around and look at runs allowed.  In order to account for the changing run scoring environments, the runs allowed by individual teams is compared to the league average for that season,...

Read more »

Run production, one team at a time

February 17, 2013
By

In a previous post, I used R to process data from the Lahman database to calculate index values that compare a team's run production to the league average for that year.  For the purpose of that exercise, I started the sequence at 1947, but for what follows I re-ran the code with the time period...

Read more »

Gist for previous posts

February 17, 2013
By

The more I use it, the more I understand the benefits and value of Github as a code-sharing resource. The gist found here is the R code for my posts on run scoring trends by league (found here, here, and here).  I will continue to use Github for t...

Read more »

Comparing individual team run production

February 3, 2013
By

Or, The 2010 Mariners: How Bad Were They?In earlier posts, I used the statistical software R to plot the trends in league average run scoring since 1901. This was the first step to answering other questions I had on my mind:How poor was the offensive performance of the 2010 Seattle Mariners?Are they showing any signs...

Read more »