Pathway Commons and paxtoolsr: An Introduction

February 8, 2016

(This article was first published on r – Lunean, and kindly contributed to R-bloggers)

Pathway Commons is an aggregation of public pathway databases and provides infrastructure for querying this rich dataset. Pathway Commons databases include: BIND, BioGRID, CORUM, CTD, DIP, DrugBank, HPRD, HumanCyc, IntAct, KEGG, MirTarBase, Panther, PhosphoSitePlus, Reactome, RECON, TRANSFAC. paxtoolsr is a package that builds on the strengths of Pathway Commons and its native BioPAX format to provide a set of R functions for interacting with BioPAX OWL files using Paxtools and the querying Pathway Commons (PC) molecular interaction database that are hosted by the Computational Biology Center at Memorial Sloan-Kettering Cancer Center (MSKCC).

The slides below give a basic introduction to the Pathway Commons database, including information about the aggregated databases, the number of the interactions in Pathway Commons, and the various file formats (e.g. BioPAX and gene sets as Gene Matrix Transposed (.gmt)) that are provided by Pathway Commons. It also describes how the rich content of these pathway databases is simplified into the Simple Interaction Format that is suitable for many research problems. Additionally, in provides a simple example on how to use the Pathway Commons data to conduct an integrative gene set enrichment analysis with cancer cell line data provided by the CellMiner and the rcellminer R package. The example shows how to use a hypergeometric test to perform the enrichment analysis and then perform a multiple testing correction using the widely used Benjamini Hochberg False Discovery Rate (FDR) method. The elemental steps of this analysis are covered in the Introduction to Statistical Methods in R.

Luna A, Babur Ö, Aksoy BA, Demir E, & Sander C (2015). PaxtoolsR: Pathway Analysis in R Using Pathway Commons. Bioinformatics (Oxford, England) PMID: 26685306


The post Pathway Commons and paxtoolsr: An Introduction appeared first on Lunean.

To leave a comment for the author, please follow the link and comment on their blog: r – Lunean. offers daily e-mail updates about R news and tutorials on topics such as: Data science, Big Data, R jobs, visualization (ggplot2, Boxplots, maps, animation), programming (RStudio, Sweave, LaTeX, SQL, Eclipse, git, hadoop, Web Scraping) statistics (regression, PCA, time series, trading) and more...

If you got this far, why not subscribe for updates from the site? Choose your flavor: e-mail, twitter, RSS, or facebook...

Comments are closed.


Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)