Major League Baseball Birth Months

[This article was first published on R – Exegetic Analytics, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

The cutoff date for almost all nonschool baseball leagues in the United States is July 31, with the result that more major league players are born in August than in any other month.Malcolm Gladwell, Outliers

A quick analysis to confirm Gladwell’s assertion above. Used data scraped from www.baseball-reference.com. Here’s the evidence:

Distribution of birth months for Major League Baseball players.

Distribution of birth months for Major League Baseball players.

We can make a quick check to see whether the non-uniformity is statistically significant.

> chisq.test(table(baseball$month))

	Chi-squared test for given probabilities

data:  table(baseball$month)
X-squared = 135, df = 11, p-value <2e-16

Yup, it appears to be highly significant.

Obviously the length of the months should make a small difference on the number of births. For example, all else being equal we would expect there to be more births in August (with 31 days) than in July (with only 30 days). We can be a bit more rigorous and take month lengths into account too.

> chisq.test(table(baseball$month), p = month$length / sum(month$length))

	Chi-squared test for given probabilities

data:  table(baseball$month)
X-squared = 115, df = 11, p-value <2e-16

Looks like the outcome is the same: there is a significant non-uniformity in the birth months of Major League Baseball players.

To leave a comment for the author, please follow the link and comment on their blog: R – Exegetic Analytics.

R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)