The 5 most downloaded R packages

[This article was first published on DataCamp Blog, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

Curious which R packages your colleagues and the rest of the R community are using? Thanks to Rdocumentation.org you can now see for yourself! Rdocumentation.org aggregates R documentation and download information from popular repositories like CRAN, BioConductor and GitHub. In this post, we’ll take a look at the top 5 R packages with the most direct downloads!*

RDocumentation for R programming users, RStudio users, R package authors and more

Visit Rdocumentation.org

1. dplyr, a grammar of data manipulation 

The R package with the highest number of direct downloads was dplyr, with 98,417 monthly direct downloads. The dplyr package, written by Hadley Wickham, is a fantastic R package for all of your data manipulation tasks! Known as the grammar of data manipulation, dplyr is built around 5 main verbs: select, mutate, filter, arrange and summarize. Beyond the easy syntax, dplyr uses efficient data storage backends, which speeds up your data manipulation tasks. Pretty handy, right? The dplyr package is easy to learn – you can start with this course on DataCamp

2. devtools, a collection of package development tools

The second most popular R package by direct downloads, with 82,962 direct downloads, is devtools. The devtools package, also developed by Hadley Wickham, is a package developer’s best friend. As Hadley explains, “the devtools package makes code sharing easy and it reduces the time and effort required to complete your project by following a template that removes the need to have you think about how to organize things and paves the way for the creation of standardized tools that can further accelerate your progress.” Download devtools and get started making your own packages today! 

3. foreign, read data stored by Minitab, S, SAS, SPSS, Stata, and more

Data comes in all shapes, sizes, and formats. The foreign package, which earned 80,191 direct downloads this month, helps you handle this! This package, developed by the folks at www.r-project.org, includes functions for reading data from Minitab, S, SAS, SPSS, Stata, Systat, Weka, dBase and more. The foreign package is a must download for R users who have colleagues that work with SAS, SPSS, STATA and other software. Don’t let different file formats discourage you, download foreign today! 

4. cluster, methods for cluster analysis

The cluster package, with 75,358 direct downloads, came in fourth this month. The cluster package, maintained by Dr. Martin Mächler, includes a variety of methods for cluster analysis including functions for hierarchical clustering, creating AGNES objects, clustering large applications and more. Check out the cluster package’s page on Rdocumentation.org to learn more and use it for your own R projects!

5. ggplot2, An implementation of the grammar of graphics in R

One of the most popular R packages for data visualization is ggplot2. This useful data visualization package had 72,550 monthly direct downloads. Authored by Hadley Wickham, the ggplot2 package offers an easy, consistent interface for producing plots and charts in R. As mentioned in its documentation, “it combines the advantages of both base and lattice graphics: conditioning and shared axes are handled automatically, and you can still build up a plot step by step from multiple data sources. It also implements a sophisticated multidimensional conditioning system and a consistent interface to map data to aesthetic attributes.” Want to learn for yourself? DataCamp offers 3 full-length courses devoted to ggplot2: 

*Rankings are based on direct downloads, as opposed to indirect downloads related to dependencies. Direct downloads were measured as of 11/04/2016. Please note these numbers can fluctuate daily. 


About RDocumentation  

RDocumentation aggregates help documentation for R packages from CRAN, BioConductor, and GitHub – the three most common sources of current R documentation. RDocumentation.org goes beyond simply aggregating this information, however, by bringing all of this documentation to your fingertips via the RDocumentaion package. The RDocumentation package overwrites the basic help functions from the utils package and gives you access to RDocumentation.org from the comfort of your RStudio IDE. Look up the newest and most popular R packages, search through documentation and post community examples. 

Create an RDocumentation account today!

To leave a comment for the author, please follow the link and comment on their blog: DataCamp Blog.

R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)