Site icon R-bloggers

Study of a Plot: The Manhattan Plot

[This article was first published on Daniel, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

I was thinking on a nice way of plotting campaign expenditures in a paper I’m working on. I thought this would be something like the following, simple, but meaningful even in the context of lots of outliers in both tails.

< !--more-->

Although I like the seniors Tukey’s boxplot and scatter plots, I had already used them, so I want to oxygenate my figures. The Manhattan plot is a potential replacement candidate.

The very idea is to have types of elections, districts or parties along the X-axis, with the negative logarithm of the association (p-value) between a candidate’s spending and votes displayed on the Y-axis. Hence, each dot on the plot indicates a candidate’s position on this metric.

Because stronger associations have the smallest p-values (a log of 0.05 = -1.30103), their negative logarithms will be positive and higher (e.g., 1.3), while those with p-values not statistically significant (whatever that means these days, maybe nothing ) will stay below this line.

The positive thing of this version is that it draws our attention to the upper outliers instead to the average association, which tends to be left-skewed because Brazilian elections attract many sacrificial lamb candidates who expend nearly nothing in their campaigns.

To leave a comment for the author, please follow the link and comment on their blog: Daniel.

R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.