Blog Archives

Left-handed catchers

July 23, 2014
By
Left-handed catchers

Benny Distefano – 1985 Donruss #166 (source: baseball-almanac.com) We are approaching the twenty-fifth anniversary of the last time a left-handed throwing catcher appeared behind the plate in a Major League Baseball game; on August 18, 1989 Benny Distefano made his third and final appearance as a catcher for the Pirates. Distefano’s accomplishment was celebrated five years...

Read more »

Book Review: Analyzing Baseball Data with R

December 17, 2013
By
Book Review: Analyzing Baseball Data with R

by Max Marchi and Jim Albert (2014, CRC Press)The Sabermetric bookshelf, #3Here we have the perfect book for anyone who stumbles across this blog--the intersection of R and baseball data. The open source statistical programming environment of R is a gr...

Read more »

A few random things

December 1, 2013
By

Read more »

Read more »

Fair weather fans, redux

September 1, 2013
By
Fair weather fans, redux

Fair weather fans, redux Or, A little larger small sample On August 11 the Victoria HarbourCats closed out their 2013 West Coast League season with a 4-3 win over the Bellingham Bells. In an earlier...

Read more »

Fair weather fans? (An R scatter plot matrix)

July 18, 2013
By
Fair weather fans? (An R scatter plot matrix)

(This article was first published on Bayes Ball, and kindly contributed to R-bloggers) The Victoria HarbourCats are roughly half way through their inaugural season in the West Coast League, and currently lead the league in average attendance.  In a recent conversation with one of the team's staff, he mentioned that after the first game in early June, the fans...

Read more »

Annotating select points on an X-Y plot using ggplot2

June 16, 2013
By
Annotating select points on an X-Y plot using ggplot2

or, Is the Seattle Mariners outfield a disaster?The BackstoryEarlier this week (2013-06-10), a blog post by Dave Cameron appeared at USS Mariner under the title “Maybe It's Time For Dustin Ackley To Play Some Outfield”. In the first paragraph, Cameron describes to the Seattle Mariners outfield this season as “a complete disaster” and Raul Ibanez as...

Read more »

Major League Baseball run scoring trends with R’s Lahman package

June 4, 2013
By
Major League Baseball run scoring trends with R’s Lahman package

The statistical software R has an ever-expanding array of packages that provide pre-programmed functions and datasets. One such package is named Lahman, bundling the contents of the Lahman database into a quick-and-easy resource for R users. In addition to the data tables, the package resources also contain a variety of analyses and graphics undertaken using...

Read more »

MLB runs allowed by team

February 24, 2013
By

Or, How good were the Maddux/Glavine-era Braves?In this on-going series of posts about run scoring in Major League Baseball, for this installment I'll turn the equation around and look at runs allowed.  In order to account for the changing run scoring environments, the runs allowed by individual teams is compared to the league average for that season,...

Read more »

Run production, one team at a time

February 17, 2013
By

In a previous post, I used R to process data from the Lahman database to calculate index values that compare a team's run production to the league average for that year.  For the purpose of that exercise, I started the sequence at 1947, but for what follows I re-ran the code with the time period...

Read more »

Gist for previous posts

February 17, 2013
By

The more I use it, the more I understand the benefits and value of Github as a code-sharing resource. The gist found here is the R code for my posts on run scoring trends by league (found here, here, and here).  I will continue to use Github for t...

Read more »