October 2025 Top 40 New CRAN Packages

Joseph Rickert

23 hours ago

[This article was first published on R Works, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)

Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

< section id="data" class="level3">

Data

AlgeriAPIs v0.1.0: Provides functions to access data from public RESTful APIs, including World Bank API and REST Countries API, retrieving real-time or historical information related to Algeria. The package enables users to query economic indicators and international demographic and geopolitical statistics in a reproducible way. See the vignette.

CopernicusClimate v0.0.3: Provides functions to subset and download data from EU Copernicus Climate Data Service, including information about the Earth’s past, present, and future climate. See the vignettes Downloading from Copernicus and Translate API Code.

FakeDataR v0.2.2: Provides functions to generate privacy-preserving synthetic datasets that mirror structure, types, factor levels, and missingness; export bundles for LLM workflows and build fake data directly from SQL database tables without reading real rows. See Nowok et al. (2016) for background methods and Bommasani et al. (2021) for an overview of the foundation model. There are three vignettes, including Getting started and Privacy and validation.

faunabr v1.0.0: Provides functions to retrieve, filter, and spatialize data from the Catálogo Taxônomico da Faunado Brasil. There are eight vignettes, including Getting Started and Flag Erroneous Records.

ForCausality v0.1.0: Provides a comprehensive set of datasets and tools for causal inference research that includes data from clinical trials, cancer studies, epidemiological surveys, environmental exposures, and health-related observational studies. The package is inspired by the foundational work of Pearl (2009). See the vignette.

healthmotionR v0.2.0: Provides a broad collection of datasets focused on health, biomechanics, and human motion, including clinical, physiological, and kinematic information from diverse sources, covering aspects such as surgery outcomes, vital signs, rheumatoid arthritis, osteoarthritis, accelerometry, gait analysis, motion sensing, and biomechanics experiments. See the vignette.

imfapi v0.1.2: Provides user-friendly functions for programmatic access to macroeconomic data from the International Monetary Fund’s SDMX 3.0 IMF Data API. See README to get started.

scf v1.0.5: Provides functions to analyze public use microdata from the Survey of Consumer Finances, including tools to download prepared data files, construct replicate-weighted multiply imputed survey designs, compute descriptive statistics and model estimates, and produce plots and tables. See the vignette.

< section id="decision-analysis" class="level3">

Decision Analysis

andorR v0.3.1: Implements a decision support tool to strategically prioritize evidence gathering in complex, hierarchical AND-OR decision trees. It is designed for situations with incomplete or uncertain information where the goal is to reach a confident conclusion as efficiently as possible (responding to the minimum number of questions, and only spending resources on generating improved evidence when it is of significant value to the final decision). There are five vignettes, including Introduction and Example Data Files.

< section id="ecology" class="level3">

Ecology

greenSD v0.1.1: Access and analyze multi-band greenspace seasonality data cubes (available for 1,028 major global cities), global Normalized Difference Vegetation Index / land cover data from the European Space Agency WorldCover 10m Dataset, and Sentinel-2-l2a images. The package also supports calculating human exposure to greenspace using a population-weighted greenspace exposure model introduced by Chen et al. (2022) based on Global Human Settlement Layer population data. See the vignette to get data, and look here for additional information.

paisaje v0.1.1: Provides functions for landscape analysis and data retrieval, which allow users to download environmental variables from global datasets (e.g., WorldClim, ESA WorldCover, Nighttime Lights), and to compute spatial and landscape metrics using a hexagonal grid system based on the H3 spatial index. See Fick and Hijmans (2017) and Zanaga et al. (2022) for background and the vignette for examples.

< section id="econometrics" class="level3">

Econometrics

BayesianDisaggregation v0.1.2: Implements a novel Bayesian disaggregation framework that combines Principal Component Analysis (PCA) and Singular Value Decomposition (SVD) dimension reduction of prior weight matrices with deterministic Bayesian updating rules. The method provides Markov Chain Monte Carlo (MCMC) free posterior estimation with built-in diagnostic metrics. Read the vignette in English or Spanish.

pvars v1.1.1: Implements panel cointegration rank tests and estimators for panel vector autoregressive models, and identification methods for panel structural vector autoregressive models. Functions allow accounting for cross-sectional dependence and for structural breaks in the deterministic terms of the VAR processes. Particularly noteworthy are the correlation-augmented inverse normal test on the cointegration rank by Arsova and Oersal (2021), the two-step estimator for pooled cointegrating vectors by Breitung (2005), and the pooled identification based on independent component analysis by Herwartz and Wang (2024). See the vignette for a detailed introduction to the package and underlying theory.

< section id="finance" class="level3">

Finance

amsSim v0.1.0: Implements simulation and pricing routines for rare-event options using adaptive multilevel splitting and standard Monte Carlo under Black-Scholes and Heston models. Core routines are implemented in C++ via Rcpp and RcppArmadillo with lightweight R wrappers. Look here for the theory and see the README to get started.

< section id="genomics" class="level3">

Genomics

BTIME v1.0.0: Implements Bayesian Hierarchical beta-binomial models for modeling cell population to predictors/exposures. This package utilizes runjags to run Gibbs sampling with parallel chains. Options allow for different covariances/relationship structures among parameters of interest. There is an Introduction and a vignette on Covariance Structures.

< section id="logic" class="level3">

Logic

Pinference v0.2.5: Implements T. Hailperin’s procedure for calculating lower and upper probability bounds for a propositional-logic expression, given equality and inequality constraints on the probabilities for other expressions. Truth-valuation is included as a special case. Applications range from decision-making and probabilistic reasoning, to pedagogical for probability and logic courses. See Hailperin (1965) background on logic and the vignette for an analysis of the Monty Hall Problem and more.

< section id="machine-learning" class="level3">

Machine Learning

bigPCAcpp v0.9.0: Implements high performance principal component analysis routines that operate directly on bigmemory::big.matrix objects. Functions avoid materializing large matrices in memory by streaming data through BLAS and LAPACK kernels and include helpers to derive scores, loadings, correlations, and diagnostics, and include utilities to stream results into bigmemory matrices for file-based workflows. Also implemented is the Scalable principal component analysis of Elgamal et al. (2015). There is an Introduction and a vignette on Benchmarking.

FuzzySpec v1.0.0: Implements FVIBES, the Fuzzy Variable-Importance Based Eigenspace Separation algorithm. See the vignette.

roclab v0.1.4: Implements ROC (Receiver Operating Characteristic)–Optimizing Binary Classifiers, supporting both linear and kernel models. Scalability for large datasets is achieved through approximation-based options, which accelerate training and make fitting feasible on large data. Utilities are provided for model training, prediction, and cross-validation. See Hernàndez-Orallo et al. (2004) background and the vignette for examples.

rSDR v1.0.3.0: Implements a novel, sufficient dimension reduction method that is robust against outliers using alpha-distance covariance and manifold-learning. See Huang et al.(2024) for details and the vignette for examples.

< section id="mathematics" class="level3">

Mathematics

SimplicialComplex v0.1.0: Implements simplicial complexes for Topological Data Analysis (TDA) and includes functions to compute faces, boundary operators, Betti numbers, and Euler characteristics. It also provides tools for studying persistent homology with the aim of helping readers understand the core concepts of computational topology. Zomorodian and Carlsson (2005) and Chazal and Michel (2021) for background and look here to access the Shiny App playground, which allows exploring the concepts underlying TDA.

< section id="medical-statistics" class="level3">

Medical Statistics

PERSUADE v0.1.2: Provides a standardized framework to support the selection and evaluation of parametric survival models for time-to-event data. Includes tools for visualizing survival data, checking proportional hazards assumptions (Grambsch and Therneau (1994)), comparing parametric (Ishak et al. (2013)), spline (Royston and Parmar (2002)) and cure models, examining hazard functions, and evaluating model extrapolation. Methods are consistent with recommendations in the NICE Decision Support Unit Technical Support Documents 14 and 21. See README to get started.

shinymrp v0.9.1: Provides a dual interface, graphical and programmatic for multilevel regression and post stratification applications, offering tools for data cleaning, exploratory analysis, model building, and visualization. Users can apply the method to a variety of datasets including electronic health records and sample survey data. See Si (2025) for background. There are five vignettes, including Getting Started and Programmatic workflow demonstration.

< section id="statistics" class="level3">

Statistics

choicedata v0.1.0: Offers a set of objects tailored to simplify working with choice data. It enables the computation of choice probabilities and the likelihood of various types of choice models based on given data. Look here for a detailed introduction.

GPpenalty v1.0.0: Implements maximum likelihood estimation for Gaussian processes, supporting both isotropic and separable models with predictive capabilities. Includes penalized likelihood estimation following Li and Sudjianto (2005). Functions use decorrelated prediction error metrics to account for uncertainty, and cross validation techniques for tuning parameter selection. Designed specifically for small datasets. See README for an example.

mda.biber v1.0.1: Implements the factor analysis developed in Biber (1992) most commonly used to describe language as it varies by genre, register, and use. Functions describe and plot MDA results, including dimension scores, dimension means, and factor loadings. See the vignette for an introduction.

PanelSelect v1.0.0: Extends the Heckman selection framework to panel data with individual random effects. The first stage models participation via a panel Probit specification, while the second stage can take a panel linear, Probit, Poisson, or Poisson log-normal form. Model details are provided in Bailey and Peng (2025) and Peng and Van den Bulte (2024). See the vignette for an introduction.

partialling.out v0.2.0: Creates a data frame with the residuals of partial regressions of the main explanatory variable and other variables of interest. This method follows the Frisch-Waugh-Lovell theorem, as explained in Lovell (2008). See the vignette.

projoint v1.0.6: Provides tools for analyzing data generated from conjoint survey experiments, including functions to estimate marginal means and average marginal component effects, with corrections for measurement error and methods for profile-level and choice-level estimators, bias correction using intra-respondent reliability, and visualization utilities. For details on the methodology, see Clayton et al. (2025). There are seven vignettes including Analyze and Visualize Important QOIs and Explore and Compare Further.

RegCalReliab v0.2.0: Implements regression calibration methods for correcting measurement error in regression models using external or internal reliability studies. Methods are described in Carroll et al. (2006). See the vignette.

RTMBdist v0.1.0: Extends the functionality of the RTMB package by providing a collection of non-standard probability distributions compatible with automatic differentiation. Automatic differentiation and Laplace approximation are described in Kristensen et al. (2016). See the vignettes, Examples and distlist.

< section id="time-series" class="level3">

Time Series

conformalForecast v0.1.0: Provides methods and tools for performing multistep-ahead time series forecasting using conformal prediction methods, including classical conformal prediction, adaptive conformal prediction, conformal PID (Proportional-Integral-Derivative) control, and autocorrelated multistep-ahead conformal prediction. The methods were described by Wang and Hyndman (2024). See the vignette for examples.

funbootband v0.2.0: Provides methods to compute simultaneous prediction and confidence bands for dense time series data. The implementation builds on the functional bootstrap approach proposed by Lenhoff et al. (1999) and extended by Koska et al. (2023) to support both independent and clustered (hierarchical) data. See the vignette.

kardl v0.1.1: Implements estimation procedures for Autoregressive Distributed Lag (ARDL) and Nonlinear ARDL (NARDL) models, which allow researchers to investigate both short and long-run relationships in time series data under mixed orders of integration. The package includes several cointegration testing approaches, such as the Pesaran et al. (2001) F and t bounds tests, the Banerjee error correction test, and the restricted ECM test, together with diagnostic tools, including Wald tests for asymmetry, ARCH tests, and stability procedures. See README to get started.

< section id="utilities" class="level3">

Utilities

bakerrr v0.2.0: Provides functions to launch, track, and control background-parallel jobs and includes utilities for job status, error handling, resource monitoring, and result collection. Designed for scalable workflows in interactive and automated settings (local or remote). Look here for more information. There are four vignettes, including Logging to File and Orchestrating Multiple Functions in Parallel and in Background.

localLLM v1.0.1: Provides R bindings to the llama.cpp library for running large language models. The package uses a lightweight architecture where the C++ backend library is downloaded at runtime rather than bundled with the package. Package features include text generation, reproducible generation, and parallel inference. Look here to get started.

rixpress v0.10.1: Provides functions to streamline the creation of reproducible analytical pipelines using default.nix expressions generated via the rix package for reproducibility. Define derivations in R, Python or Julia, chain them into a composition of pure functions, and build the resulting pipeline using Nix as the underlying end-to-end build tool. Functions to plot the pipeline as a directed acyclic graph are included, as well as functions to load and inspect intermediary results for interactive analysis. There are twelve vignettes, including introductory concepts and core functions.

summarytabl v0.2.1: Provides functions to tabulate and summarize categorical, multiple response, ordinal, and continuous variables in R data frames, making it easy to create clear, structured summary tables. See the vignette.

tabler v0.1.0: Provides functions to build interactive dashboards combining the Tabler UI Kit with Shiny, making it easy to create professional-looking web applications. Dashboards are fully responsive and compatible with all modern browsers. Offers customizable layouts and components built with HTML5 and CSS3. See README to get started.

< section id="visualization" class="level3">

Visualization

graphonmix v0.0.1.0: Generates (U,W) mixture graphs where U is a line graph graphon and W is a dense graphon. Graphons are graph limits and graphon U can be written as a sequence of positive numbers adding to 1. Graphs are sampled from U and W and joined randomly to obtain the mixture graph. Given a mixture graph, U can be inferred. See Kandanaarachchi and Ong (2025) for background and the vignettes Introduction and Sparse graphs from line graphons.

SimpleUpset v0.1.3: Provides functions to create Upset plots using a combination of ggplot2 and patchwork. See Lex et al. (2014) for background and the vignette for examples.

To leave a comment for the author, please follow the link and comment on their blog: R Works.

R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.