Notable New and Updated R packages (to October 2016)

November 17, 2016
By

(This article was first published on Revolutions, and kindly contributed to R-bloggers)

As we prepare for the upcoming release of Microsoft R Open, I've been preparing the list of new and updated packages for the spotlights page. This involves scanning the CRANberries feed (with gracious thanks to Dirk Eddelbuettel) for newly-released packages and significant updates to existing ones.

This is a lot of data to process. For context, in October 2016 alone there were 174 new packages and 629 updated packages. For the 4 months since the last MRO release, I reviewed all of the new packages and some of the updates (those with a 1.0 or 2.0 release) by hand, and selected those that are likely to be of interest to a broad segment of R users. I hope you find some useful nuggets in the list. It's likely I missed some good ones; if you know of any others that should have been included, let me know in the comments.

 

Notable New and Updated R Packages: July — October 2016

Data Munging

anytime: Convert anything that looks vaguely like a date into a POSIXct object. Examples here.

fasttime: Fast functions for timestamp manipulation.

forcats: Tools for working with categorical variables (factors).

roll: Parallel functions for computing rolling statistics of time-series data.

stringb: String handling functions with an interface like stringr/stringi, but without the dependencies.

tidyverse: A collection of popular packages for data munging from Hadley Wickham. More here.

 

Data Sources

countyweather: Meterological time series data for U.S. counties, from NOAA.

darksky: Interface to the Dark Sky API, which provides current or historical global weather conditions.

energyr: Data published by the United States Federal Energy Regulatory Commission.

genderNames: API client for genderize.io which will tell you the gender of the name you input.

Rlinkedin (updated): Provides access to the LinkedIn API for people, job, and discussion data.

outbreaks: A collection of disease outbreak data, some real, some simulated.

rnoaa: Download NOAA weather data. Vignettes provided for many different data types.

rpinterest: Access the Pinterest API for data on boards, pins and users.

ttbbeer: Beer statistics from the U.S. Department of the Treasury, Alcohol and Tobacco Tax and Trade Bureau.

ubeR: Interface to the Uber API.

 

Graphics

cartogram: Create and plot a continuous area cartogram by a rubber sheet distortion algorithm.

colourlovers: Provides access to the COLOURlovers API, which offers color inspiration and color palettes.

earthtones: Derive a color palette from a satellite image of a location on Earth. The vignette includes examples.

Homeric: Doughnut plots – pie charts with a blank center. The package is named after Homer Simpson.

RSVGTipsDevice (updated): An R SVG graphics device with support for dynamic tips and hyperlinks.

tweenr (updated): Create smooth animations by interpolating data.

 

Interfaces

haven (updated): Import and export SPSS, Stata and SAS Files. Details here.

readr (updated): Read flat/tabular text files from disk (or a connection). Overview here.

SnakeCharmR: Run Python code, make function calls, and assign and retrieve variables from R.

sofa: Provides an interface to the NoSQL database CouchDB.

sparklyr: Provision, connect and interface to Apache Spark from within R. Details at spark.rstudio.com.

XR: Support for interfaces from R to other languages, described in the book “Extending R” by John Chambers.

 

Programming Tools

checkpoint (updated): Easily use R packages from a specific date for reproducibility (see the vignette). This update adds the ability to delete snapshots and use a local snapshot repository.  

curl (updated): A modern and flexible web client for R. Vignette included.

curry: Partial function application with %<%, %-<%, and %><%.

jsonlite (updated): fast JSON parser and generator optimized for statistical data and the web.

proto (updated): An object oriented system using object-based rather than class-based object oriented ideas, from Hadley Wickham. Includes a vignette.

Rfast: Fast covariance matrix calculation, Mahalanobis distance and column-wise variances and other fast reimplementations of R functions.

sticky: Allows objects to be marked as 'sticky' and have resilient attributes that persist during many operations.

types: Simple type annotation for R, as a convention to allow other packages to provide error checking, automatic documentation or optimizations.

 

Reporting

bookdown: Output formats and utilities for authoring books with R Markdown.

htmltidy: Clean up gnarly HTML generated by the likes of Google Docs and Microsoft Word.

rmarkdown (updated): Dynamic documents for R. Overview and examples here.

tables (updated): Compute and display complex tables of summary statistics. Examples in the vignette.

tint: Format RMarkdown documents in the style of an Edward Tufte book. Examples here.

 

Statistical Modeling and Analysis

arc: Classification-based on Association Rules (CBA) algorithm for association rule classification (ARC).

brms: Bayesian regression models using Stan. Vignette included.

CADStat: Provides a GUI to several statistical methods including generalized linear regression, conditional probability calculations, and regression trees. The manual provides more details.

endogenous: Classical Simultaneous Equation Models.

ensembleR: Functions to use ensembles of several machine learning models specified in caret package.

MSGARCH: Markov-Switching GARCH Models.

oddsratio: Odds Ratio Calculation for Generalized Additive Models and Generalized Linear Models.

pkmon: Least-squares estimators under k-monotony constraint. Pikachu not included.

radiant: Business analytics using R and Shiny. Overview here.

simputation: Missing value imputation methods that work with the magrittr %>% pipe operator. Vignette included.

spikes: Detect election fraud from irregularities in vote-share distributions.

SurvDisc: Functions for discrete time survival analysis and longitudinal analysis, from the FDA.

 

Utilities

convertr: Convert between a broad range of scientific, historical, and industrial unit types.

digitize: Extract data from images of printed charts.

measurements: Convert between metric and imperial measurement units.

qrencoder: Create scannable QR codes and matrix barcodes.

optimr: A replacement and extension of the 'optim' function for smooth, possibly box constrained functions of several or many parameters.

RDocumentation: Display help files as they appear on RDocumentation.org

recordr: Record data provenance about R script executions (files read/written, packages used, timestamps etc.)

triangulation: Determine the location of an observer based on the angle towards three known points.

 

Other

FuzzyR: Design and simulate fuzzy logic systems using Type 1 Fuzzy Logic.

statquotes: Random quotations on topics in statistics, data visualization and science.

wfindr: Crossword, Scrabble and anagram solver.

 

You can find more notable packages from prior releases of MRO at the Spotlights page on MRAN.

 

To leave a comment for the author, please follow the link and comment on their blog: Revolutions.

R-bloggers.com offers daily e-mail updates about R news and tutorials on topics such as: Data science, Big Data, R jobs, visualization (ggplot2, Boxplots, maps, animation), programming (RStudio, Sweave, LaTeX, SQL, Eclipse, git, hadoop, Web Scraping) statistics (regression, PCA, time series, trading) and more...



If you got this far, why not subscribe for updates from the site? Choose your flavor: e-mail, twitter, RSS, or facebook...

Comments are closed.

Search R-bloggers


Sponsors

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)