Homer, not Bart, is the star of the Simpsons

October 3, 2016

(This article was first published on Revolutions, and kindly contributed to R-bloggers)

It's been a long time since I watched the The Simpsons, but I was always under the impression that Bart was the primary character. Perhaps it was all the Do the Bartman and "Cowabunga!" nonsense from the 90s. Anyway, data scientist Todd W Schneider used R to analyze the scripts of the first 26 seasons and found that Homer speaks twice as much as next most represented character, Marge. Bart comes a close third.


Marge and Lisa are represented in orange (the color of Lisa's dress, in fact) as the only 2 female characters that make the top 10. Female representation isn't much better in the supporting cast either; only 7 characters of the top 60 (12%) are female.

Todd's R code behind the blog post is available on Github (in the analysis folder). Of note to R programmers: Todd used the ggplot2 package to create the charts and created a custom ggplot2 theme for the charts (theme_tws_simpsons) using the Simpsons skin yellow and the Akbar font

For more data analysis of the Simpsons, including a look at the ratings over the last 27 years, check out the Todd's blog bost linked below.

Todd W. Schneider: The Simpsons by the Data (via Jenny Bryan)

To leave a comment for the author, please follow the link and comment on their blog: Revolutions.

R-bloggers.com offers daily e-mail updates about R news and tutorials on topics such as: Data science, Big Data, R jobs, visualization (ggplot2, Boxplots, maps, animation), programming (RStudio, Sweave, LaTeX, SQL, Eclipse, git, hadoop, Web Scraping) statistics (regression, PCA, time series, trading) and more...

If you got this far, why not subscribe for updates from the site? Choose your flavor: e-mail, twitter, RSS, or facebook...

Comments are closed.


Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)