June 2025 Top 40 New CRAN Packages
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.
In June, one hundred twenty-three new packages made it to CRAN. Here are my Top 40 picks in twenty-one categories: AI, Chess, Computational Methods, Data, Decision Analysis, Ecology, Epidemiology, Finance, Genomics, Linguistics, Machine Learning, Mathematics, Medical Statistics, Music Theory, Networks, Programming, Statistics, Time Series, Utilities, and Visualization.
AI
statlingua v0.1.0: Provides functions to transform complex statistical output into straightforward, understandable, and context-aware natural language descriptions using Large Language Models (LLMs). It works with OpenAI, Google AI Studio, and Anthropic. See the vignette for examples.
vitals v0.1.0: Provides a port of Inspect
, a widely adopted Python
framework for large language model evaluation that supports prompt engineering, tool usage, multi-turn dialog, and model graded evaluations. It is specifically aimed at ellmer
users who want to measure the effectiveness of their LLM based products. There are two vignettes: Getting Started and Writing Evals.
Chess
chess2plyrs v0.3.0: Implements a chess program based on the Minimax chess engine which allows users to create games, read and write FEN and more. See the vignette for examples.
Computational Methods
tvdenoising v1.0.0: Implements total variation denoising which can be used to approximate a given sequence of noisy observations by a piecewise constant sequence with adaptively-chosen break points. See Johnson (2013) for background and README for an example.
wideRhino v1.0.2: Provides functions to construct a canonical Variate Analysis biplot via the Generalized Singular Value Decomposition for cases when the number of samples is less than the number of variables. For more information on biplots, see Gower et al. (2011), for a discussion of the generalized singular value decomposition, see Edelman & Wang (2020), and see README for examples.
Data
avilistr v0.0.1: Provides easy access to the AviList Global Avian Checklist, the first unified global bird taxonomy that harmonizes differences between International Ornithological Committee, Clements, and BirdLife checklists. See the vignette to get started.
ecoteach v0.1.0: A collection of curated educational datasets for teaching ecology and agriculture concepts. Includes data on wildlife monitoring, plant treatments, and ecological observations with documentation and examples for educational use. All datasets are derived from published scientific studies. See the vignette.
jpinfect v0.1.2: Provides functions to download and post process the infectious disease case data from the Japan Institute for Health Security. See the vignette to get started.
LBDiscover v0.1.0: Provides a suite of tools for literature-based discovery in biomedical research, including functions for retrieving scientific articles from PubMed and other NCBI databases, extracting biomedical entities (diseases, drugs, genes, etc.), building co-occurrence networks, and applying various discovery models. See README to get started.
Rdatasets v0.0.1: Provides functions to search, download, and view documentation for thousands of datasets from R packages included in the Rdatasets archive. Datasets are available in both CSV and Parquet formats. See README to get started.
Decision Analysis
RMCDA v0.3: Implements methods including AHP, TOPSIS, PROMETHEE, VIKOR, Stratified MCDM, and the Stratified Best–Worst Method to support multiple criteria decision making. Najafi & Mirzaei (2025) contains references for all of these acronyms and provides details of the package. See the vignette for examples.
Ecology
climodr v1.0.0: Provides tools to automate workflows for predictive climate mapping using climate station data, and tools to create reproducible climate models. See Meyer (2019) and Meyer (2022) for background on the methods implemented and the vignette and website for details.
movedesign v0.3.1: Implements a toolbox and shiny
application to help researchers design movement ecology studies, focusing on two key objectives: estimating home range areas, and estimating fine-scale movement behavior, specifically speed and distance traveled. See Silva et al. (2023) for background. There are three vignettes, including Get started and Tutorial.
Epidemiology
infectiousR v0.1.0: Provides functions to access real-time infectious disease data from the disease.sh API, including global COVID-19 data, vaccination coverage, influenza-like illness data from CDC, and more. It also includes curated datasets on a variety of infectious diseases. See the vignette to get started.
rifttable v0.7.1: Provides functions to automate the production of reproducible, presentation-ready tables for epidemiologists. Users specify the design of the table, with rows and/or columns defined by exposures, effect modifiers, and estimands. See Rothman (2017) for background. There are six vignettes, including Get started and Survival outcomes.
Finance
fEGarch v1.01: Provides functions to implement and fit a variety of short-memory and long-memory models from a very broad family of exponential generalized autoregressive conditional heteroskedasticity EGARCH models. Includes MEGARCH, modified EGARCH, FIEGARCH, fractionally integrated EGARCH, and FIMLog-GARCH, fractionally integrated modulus Log-GARCH. Look here for background and see README for examples.
Genomics
multiDEGGs 1.0.0: Provides functions to perform multi-omic differential network analysis by revealing differential interactions between molecular entities (genes, proteins, transcription factors, or other biomolecules) across the omic datasets provided. A comprehensive visualization from differential networks is constructed for each omic dataset, where links represent statistically significant differential interactions between entities. See Sciacca et al. (2023) for information on the methods implemented, and the vignette for examples.
rsynthbio v2.0.0: Implements a wrapper to the Synthesize Bio API, which enables users to generate realistic gene expression data based on specified biological conditions. Researchers can access AI-generated transcriptomic data for various modalities, including bulk RNA-seq, single-cell RNA-seq, and microarray data. See the vignette to get started.
Linguistics
tidynorm v0.3.0: Implements tidy speaker vowel normalization and includes generic functions for defining new normalization methods for points, format tracks, and Discrete Cosine Transform coefficients, as well as convenience functions implementing established normalization methods. See Johnson (2020), Lobanov (1971), and Watt & Fabricius (2002) for the theory. There are five vignettes, including Normalization Methods and Normalization Overview.
Machine Learning
midr v0.5.0: Implements Maximum Interpretation Decomposition, a functional decomposition technique, to provide a model-agnostic method for interpreting and explaining black-box predictive models by creating a globally interpretable surrogate model. See Asashiba et al. (2025) for details and README for examples.
Mathematics
polarzonoid v0.1-2: Implements some applications of the polar zonoid, a straightforward generalization of the polar zonohedron in dimension 3 and a root solver for trigonometric polynomials. There are five vignettes, including a User Guide and Real Projective Spaces and 3×3 Rotation Matrices.
Medical Statistics
bbssr v1.0.2: Provides comprehensive tools for blinded sample size re-estimation in two-arm clinical trials with binary endpoints that allow for adaptive sample size adjustments during trials while maintaining statistical integrity and study blinding. Implements five exact statistical tests: Pearson chi-squared, Fisher exact, Fisher mid-p, Z-pooled exact unconditional, and Boschloo exact unconditional tests. See Mehrotra et al. (2003) and Kieser (2020). There are three vignettes, including an Introduction and Statistical Methods.
causens v0.0.3: Implements methods to perform causal sensitivity analysis to adjust for possible unmeasured confounders when working with observational data. Methods include those developed in Brumback et al. (2004), Li et al. (2011), the Bayesian and Monte Carlo approaches of McCandless et al. (2017). See the vignette for examples.
door v0.0.2: Provides functions for the design, analysis, and interpretation of clinical trials and other research studies based on the patient centric benefit risk evaluation. See Hamasaki & Evans (2025) for details of the statistical methods and the Shiny App for documentation and examples.
Music Theory
musicMCT v0.2.0: Provides functions to analyze musical scales à la Modal Color Theory of Sherrill (2025), work with conventional music pitch theory and the continuous geometries of Callender et al. (2008), and identify structural properties of scales and calculate derived values. There is an Introduction and a vignette on Visualizing Higher Dimensions.
Networks
INetTool v0.1.1: Implements methods to model complex systems as a consensus network where nodes can represent either statistical units or observed variables and edges represent distance metrics or correlation between units. See Policastro et al. (2024) for a description of the method and the vignette for an example.
Programming
putior v0.1.0: Provides tools for extracting and processing structured annotations from R
and Python
source files to facilitate workflow visualization. Functions scan source files for annotations that define nodes, connections, and metadata within a data processing workflow, which are used to generate visual representations of data flows and processing steps across polyglot software environments. See Knuth (1984) for background and the vignette and README for more information.
quickr v0.1.0: Provides compiled R
functions annotated with type and shape declarations for fast performance with robust runtime type checking. Supports both just-in-time (JIT) and ahead-of-time (AOT) compilation. Compilation is performed by lowering R
code to FORTRAN
. See README for examples.
Statistics
aamatch v0.3.7: Implements a simple version of multivariate matching using a propensity score, near-exact matching, near-fine balance, and robust Mahalanobis distance matching. You specify the variables, and the program does everything else. See Rosenbaum (2020) for details.
bayesmsm v1.0.0: Implements Bayesian marginal structural models for causal effect estimation with time-varying treatment and confounding and includes an extension for informative right censoring. See Saarela (2015) for methodological details. There are two vignettes containing examples with right censoring and without right censoring.
BCD v0.1.1: Implements bivariate binomial, geometric, and Poisson distributions based on conditional specifications and includes tools for data generation and goodness-of-fit testing for these three distribution families. For methodological details, see Ghosh et al.(2025), Ghosh et al. (2023), and Ghosh et al. (202). There are three vignettes: Bivariate Binomial Conditionals, Bivariate Geometric Conditionals, and Bivariate Poisson Conditionals.
lognGPD v0.1.0: Provides functions to estimate a lognormal, generalized Pareto mixture model via the Expectation-Maximization algorithm and includes functions for random number simulation and density evaluation. For details, see Bee & Santi (2025).
QuantilePeer v0.0.1: Provides functions to simulate and estimate peer effect models, including the quantile-based specification (Houndetoungan (2025)), and the models with Constant Elasticity of Substitution (CES)-based social norm (Boucher et al. (2024)). See the vignette for details.
riskdiff v0.2.1: Provides functions to calculate risk differences (or prevalence differences for cross-sectional data) using generalized linear models with automatic link function selection. See Austin (2011) for background on propensity score matching and Donoghoe & Marschner (2018) for package details. There are two vignettes: Getting Started, Causal Inference.
survextrap v1.0: Provides functions for survival analysis using Bayesian models for individual-level right-censored data. Hazard functions are modeled with M-splines. Priors can be customized and calibrated to substantive beliefs. Posterior distributions are estimated using Stan
. See Jackson (2023) for details. There are three vignettes, including Examples and Priors.
unsum v0.2.0: Reconstructs all possible raw data that could have led to reported summary statistics. Provides a wrapper for the Rust
implementation of the CLOSURE
algorithm. See the vignette.
Time Series
gseries v3.0.2: Provides functions to improve the coherence of time series data. The methods used are described in Dagum & Cholette (2006). There are four vignettes, including Benchmarking Cookbook and A Beginners Benchmarking Script.
Utilities
blocking v1.0.1: Provides blocking methods for record linkage and deduplication using approximate nearest neighbor algorithms, functions to generate shingles from character strings and similarity vectors for record comparison, and evaluation metrics for assessing blocking performance, including false positive and false negative rates. For background and details, see Papadakis et al. (2020), Steorts et al. (2014), Dasylva and Goussanou (2021), and Dasylva and Goussanou (2022). There are three vignettes, including blocking records for deduplication and record linkage.
flir v0.5.0: Provides functions to find and fix lints (code patterns that are not optimal because they are inefficient) in R code. There are three vignettes including Adding new rules and Automatic fixes.
Visualization
fractalforest v1.0.1: Provides functions to create and visualize fractal trees and fractal forests, based on the Lindenmayer system (L-system). For more details, see Lindenmayer (1968a) and Lindenmayer (1968b). There is an Introduction and a vignette on customizing fractal trees.
ggtime v0.1.0: Extends ggplot2
by implementing a grammar of temporal graphics and helper functions for visualizing temporal patterns in time series graphics, time plots, season plots, and seasonal sub-series plots. See README for examples.
R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.