Notable New and Updated R packages (to October 2016)

[This article was first published on Revolutions, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

As we prepare for the upcoming release of Microsoft R Open, I've been preparing the list of new and updated packages for the spotlights page. This involves scanning the CRANberries feed (with gracious thanks to Dirk Eddelbuettel) for newly-released packages and significant updates to existing ones.

This is a lot of data to process. For context, in October 2016 alone there were 174 new packages and 629 updated packages. For the 4 months since the last MRO release, I reviewed all of the new packages and some of the updates (those with a 1.0 or 2.0 release) by hand, and selected those that are likely to be of interest to a broad segment of R users. I hope you find some useful nuggets in the list. It's likely I missed some good ones; if you know of any others that should have been included, let me know in the comments.

 

Notable New and Updated R Packages: July — October 2016

Data Munging

anytime: Convert anything that looks vaguely like a date into a POSIXct object. Examples here.

fasttime: Fast functions for timestamp manipulation.

forcats: Tools for working with categorical variables (factors).

roll: Parallel functions for computing rolling statistics of time-series data.

stringb: String handling functions with an interface like stringr/stringi, but without the dependencies.

tidyverse: A collection of popular packages for data munging from Hadley Wickham. More here.

 

Data Sources

countyweather: Meterological time series data for U.S. counties, from NOAA.

darksky: Interface to the Dark Sky API, which provides current or historical global weather conditions.

energyr: Data published by the United States Federal Energy Regulatory Commission.

genderNames: API client for genderize.io which will tell you the gender of the name you input.

Rlinkedin (updated): Provides access to the LinkedIn API for people, job, and discussion data.

outbreaks: A collection of disease outbreak data, some real, some simulated.

rnoaa: Download NOAA weather data. Vignettes provided for many different data types.

rpinterest: Access the Pinterest API for data on boards, pins and users.

ttbbeer: Beer statistics from the U.S. Department of the Treasury, Alcohol and Tobacco Tax and Trade Bureau.

ubeR: Interface to the Uber API.

 

Graphics

cartogram: Create and plot a continuous area cartogram by a rubber sheet distortion algorithm.

colourlovers: Provides access to the COLOURlovers API, which offers color inspiration and color palettes.

earthtones: Derive a color palette from a satellite image of a location on Earth. The vignette includes examples.

Homeric: Doughnut plots – pie charts with a blank center. The package is named after Homer Simpson.

RSVGTipsDevice (updated): An R SVG graphics device with support for dynamic tips and hyperlinks.

tweenr (updated): Create smooth animations by interpolating data.

 

Interfaces

haven (updated): Import and export SPSS, Stata and SAS Files. Details here.

readr (updated): Read flat/tabular text files from disk (or a connection). Overview here.

SnakeCharmR: Run Python code, make function calls, and assign and retrieve variables from R.

sofa: Provides an interface to the NoSQL database CouchDB.

sparklyr: Provision, connect and interface to Apache Spark from within R. Details at spark.rstudio.com.

XR: Support for interfaces from R to other languages, described in the book “Extending R” by John Chambers.

 

Programming Tools

checkpoint (updated): Easily use R packages from a specific date for reproducibility (see the vignette). This update adds the ability to delete snapshots and use a local snapshot repository.  

curl (updated): A modern and flexible web client for R. Vignette included.

curry: Partial function application with %<%, %-<%, and %><%.

jsonlite (updated): fast JSON parser and generator optimized for statistical data and the web.

proto (updated): An object oriented system using object-based rather than class-based object oriented ideas, from Hadley Wickham. Includes a vignette.

Rfast: Fast covariance matrix calculation, Mahalanobis distance and column-wise variances and other fast reimplementations of R functions.

sticky: Allows objects to be marked as 'sticky' and have resilient attributes that persist during many operations.

types: Simple type annotation for R, as a convention to allow other packages to provide error checking, automatic documentation or optimizations.

 

Reporting

bookdown: Output formats and utilities for authoring books with R Markdown.

htmltidy: Clean up gnarly HTML generated by the likes of Google Docs and Microsoft Word.

rmarkdown (updated): Dynamic documents for R. Overview and examples here.

tables (updated): Compute and display complex tables of summary statistics. Examples in the vignette.

tint: Format RMarkdown documents in the style of an Edward Tufte book. Examples here.

 

Statistical Modeling and Analysis

arc: Classification-based on Association Rules (CBA) algorithm for association rule classification (ARC).

brms: Bayesian regression models using Stan. Vignette included.

CADStat: Provides a GUI to several statistical methods including generalized linear regression, conditional probability calculations, and regression trees. The manual provides more details.

endogenous: Classical Simultaneous Equation Models.

ensembleR: Functions to use ensembles of several machine learning models specified in caret package.

MSGARCH: Markov-Switching GARCH Models.

oddsratio: Odds Ratio Calculation for Generalized Additive Models and Generalized Linear Models.

pkmon: Least-squares estimators under k-monotony constraint. Pikachu not included.

radiant: Business analytics using R and Shiny. Overview here.

simputation: Missing value imputation methods that work with the magrittr %>% pipe operator. Vignette included.

spikes: Detect election fraud from irregularities in vote-share distributions.

SurvDisc: Functions for discrete time survival analysis and longitudinal analysis, from the FDA.

 

Utilities

convertr: Convert between a broad range of scientific, historical, and industrial unit types.

digitize: Extract data from images of printed charts.

measurements: Convert between metric and imperial measurement units.

qrencoder: Create scannable QR codes and matrix barcodes.

optimr: A replacement and extension of the 'optim' function for smooth, possibly box constrained functions of several or many parameters.

RDocumentation: Display help files as they appear on RDocumentation.org

recordr: Record data provenance about R script executions (files read/written, packages used, timestamps etc.)

triangulation: Determine the location of an observer based on the angle towards three known points.

 

Other

FuzzyR: Design and simulate fuzzy logic systems using Type 1 Fuzzy Logic.

statquotes: Random quotations on topics in statistics, data visualization and science.

wfindr: Crossword, Scrabble and anagram solver.

 

You can find more notable packages from prior releases of MRO at the Spotlights page on MRAN.

 

To leave a comment for the author, please follow the link and comment on their blog: Revolutions.

R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)