In a recent blog post, I introduced the new R package, manhattanly, which creates interactive manhattan and Q-Q plots using the plotly.js engine. In the latest CRAN release, you can also create volcano plots.
In this post, I describe how to create interactive volcano plots using the manhattanly package. Volcano plots are the negative log10 p-values plotted against their effect size, odds ratio or log fold-change. They are used to identify clinically meaningful markers in genomic experiments, i.e., markers that are statistically significant and have an effect size greater than some threshold.
Visit the package website for full details and example usage.
The following three lines of code will produce the Volcano plot below
install.packages("manhattanly") library(manhattanly) volcanoly(HapMap, snp = "SNP", gene = "GENE")
Notice that we have added two annotations (the SNP and nearest GENE), that are revealed when hovering the mouse over a point. This feature of interactive volcano plots adds a great deal of information to the plot without cluttering it with text.
Inspired by the heatmaply package by Tal Galili, we split the tasks into data pre-processing and plot rendering. Therefore, we can use the
manhattanly::volcanor function to get the data used to produce a volcano plot. This allows flexibility in the rendering of the plot, since any graphics package, such as
plot in base R can make used to create the plot.
The plot data is derived using the
volcanorObject <- volcanor(HapMap, snp = "SNP") str(volcanorObject)
List of 8
$ data :'data.frame': 14412 obs. of 4 variables:
..$ EFFECTSIZE: num [1:14412] -0.0946 -0.0947 -0.0741 0.0146 0.1234 ...
..$ P : num [1:14412] 0.335 0.246 0.823 0.493 0.605 ...
..$ SNP : chr [1:14412] "rs9697358" "rs34945898" "rs12034613" "rs4648633" ...
..$ LOG10P : num [1:14412] 0.4745 0.6093 0.0844 0.307 0.218 ...
$ pName : chr "P"
$ effectName : chr "EFFECTSIZE"
$ xlabel : chr "EFFECTSIZE"
$ snpName : chr "SNP"
$ geneName : logi NA
$ annotation1Name: logi NA
$ annotation2Name: logi NA
- attr(*, "class")= chr "volcanor"
EFFECTSIZE P SNP LOG10P
1 -0.0946 0.3353438 rs9697358 0.47450973
2 -0.0947 0.2458571 rs34945898 0.60931719
3 -0.0741 0.8232859 rs12034613 0.08444933
4 0.0146 0.4932038 rs4648633 0.30697357
5 0.1234 0.6053916 rs4430271 0.21796358
6 0.1979 0.1944431 rs6685625 0.71120743
volcanorObject which is of class
volcanor can also be passed to the
manhattanly::volcanoly function to produce the inteactive volcano plot above:
By default, the points greater than the default
effect_size_line arguments are highlighted. The defaults are
genomewideline = -log10(1e-5) and
effect_size_line = c(-1,1). The
effect_size_line argument must be a numeric vector of length 2 and the first argument must be smaller than the second. To highlight more points, you simply need to change those thresholds. You can set either of the
effect_size_line arguments to
FALSE to remove that threshold:
volcanoly(HapMap, snp = "SNP", genomewideline = -log10(1e-2), effect_size_line = FALSE)
The manhattanly package is based on the qqman package by Stephen Turner. It produces similar manhattan and Q-Q plots as the
qqman::qq functions; the main difference here is being able to interact with the plot, including extra annotation information, seamless integration with HTML and creating interactive volcano plots with automated highlighting of interesting points.