Pop quiz! What is this chart saying?

November 8, 2016
By

(This article was first published on R – AmitKohli.com, and kindly contributed to R-bloggers)

I have been reading more and more about how people can’t interpret charts… which kinda never occurred to me, if I’m gonna be very honest.  Anyway, it kind of made me think of actually testing people informally, to see for myself. So I’ve been doing just that: showing colleagues, friends, etc a chart that we created interactively during the first Accra R-Users session with tons of detail, and asking them to analyze it at length. The results have been staggering! I’m still trying to generalize my conclusions, but thought it would be fun to open up this test to the community, so here it goes! If you feel like sharing, post your observations in the comments section.

“The following chart shows the ratings (imdb) for ~60k movies throughout the years. Movies are divided by their genre (in the case a movie has multiple genres it shows up in all genres), and their budgets are shown in color. All movies are shown as mostly transparent so darker patches mean more movies. Talk for 3 minutes about what this chart is showing, try to explain stuff, and think of what other analysis should follow.”

movieratingsyeargenrebudget

(click to magnify)

 

The R code to get this chart follows, or you could find the entire exploratory exercise in the github page.

 

library(ggplot2)
library(ggplot2movies)
library(tidyr)
library(dplyr)

## Gather up all ratings into one column, then use that to divide up the movies dataframe and plot
movies %>% 
 select(-(r1:r10)) %>%
 gather(key = genre,val , Action:Short) %>%
 filter(val==1) %>% 
 ggplot(aes(x=year,y=rating,color=budget,label=title))+geom_point(alpha=0.1)+facet_wrap(~genre) + 
 scale_color_gradient(low="red",high="green") +
 ggtitle("Movie ratings by year")

To leave a comment for the author, please follow the link and comment on their blog: R – AmitKohli.com.

R-bloggers.com offers daily e-mail updates about R news and tutorials on topics such as: Data science, Big Data, R jobs, visualization (ggplot2, Boxplots, maps, animation), programming (RStudio, Sweave, LaTeX, SQL, Eclipse, git, hadoop, Web Scraping) statistics (regression, PCA, time series, trading) and more...



If you got this far, why not subscribe for updates from the site? Choose your flavor: e-mail, twitter, RSS, or facebook...

Comments are closed.

Search R-bloggers


Sponsors

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)