R-bloggers

Q: How do you pronounce Likert?

The correct pronunciation is LICK-ert, not LIKE-ert. It is named after Rensis Likert, who created the scale in 1932.

Q: Should I use a 5-point or 7-point Likert scale?

For general surveys, a 5-point scale is easier for respondents. For academic research requiring fine discrimination between attitudes, a 7-point scale is preferred. Both produce similar mean scores once rescaled.

Q: Can you calculate the mean from Likert scale data?

For a single Likert item, the median is more appropriate because the data is ordinal. For a multi-item Likert scale with a reasonable sample size, calculating the mean is widely accepted in practice.

Q: What is acquiescence bias in Likert scales?

Acquiescence bias is the tendency of respondents to agree with statements regardless of content. It is reduced by including both positively and negatively worded items in the same scale.

Q: Is it necessary to include a neutral response option in a Likert scale?

No. An odd-point scale includes a neutral midpoint; an even-point scale forces a directional choice. The decision depends on whether genuine neutrality is meaningful in your research context.

Likert Scale Questions: Your In-Depth Guide

Unknown — Thu, 18 Jun 2026 09:46:51 +0000

[This article was first published on RStudioDataLab, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)

Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

A Likert scale (pronounced LICK-ert, not “LIKE-ert”) is a psychometric rating scale used in surveys and questionnaires to measure attitudes, opinions, and perceptions. Named after American social psychologist Rensis Likert, who developed it in 1932, it remains the most widely used approach to scaling responses in survey research today.

Key Takeaways

Definition: A Likert scale measures how strongly people agree or disagree with a statement, typically using 5 or 7 ordered response options.
Structure: Each item presents a statement followed by response options from “Strongly Disagree” to “Strongly Agree.”
Purpose: Turns subjective opinions into quantitative data for statistical analysis.
Formats: 4-point, 5-point, 6-point, and 7-point scales each serve different research needs.
Analysis: Use median and frequency tables for single items; use mean and Cronbach’s alpha for multi-item scales.

Likert Scale vs. Likert Item: An Important Distinction

These two terms are often used interchangeably — that is technically incorrect and matters for your analysis.

A Likert item is a single statement with a rated response (e.g., “I am satisfied with the service: Strongly Disagree → Strongly Agree”).
A Likert scale is the sum or average of several related Likert items designed to measure a single construct.

Treating a single item as a complete “scale” is one of the most common errors in survey design. If you have only one question, you have a Likert item, not a Likert scale — and the appropriate statistical treatment differs.

Types of Likert Scale Response Options

Likert scales are not limited to measuring agreement. Depending on your research objective, you can measure frequency, importance, quality, or likelihood using the same format. The table below shows the most common response option sets:

Dimension	Option 1	Option 2	Option 3	Option 4	Option 5
Agreement	Strongly Disagree	Disagree	Neither	Agree	Strongly Agree
Frequency	Never	Rarely	Sometimes	Often	Always
Importance	Not Important	Slightly Important	Moderately Important	Important	Very Important
Quality	Very Poor	Poor	Fair	Good	Excellent
Likelihood	Definitely Not	Probably Not	Possibly	Probably	Definitely
Satisfaction	Very Dissatisfied	Dissatisfied	Neutral	Satisfied	Very Satisfied

Likert Scale Formats: 4, 5, 6, and 7 Points Compared

Choosing the right number of response points affects the precision of your data and the cognitive load on respondents. Here is how each format differs in practice:

Format	Neutral Point?	Best For	Main Trade-off
4-Point	No (forced choice)	When you need a clear directional opinion	Can frustrate genuinely neutral respondents
5-Point	Yes	Most general research; most familiar to respondents	Central tendency bias is common
6-Point	No (forced choice)	When you want fine-grained data without a fence-sitter option	Less intuitive labeling
7-Point	Yes	Academic research requiring maximum discrimination	Harder to label all points meaningfully
10-Point	Yes (implied midpoint)	NPS-style scoring; familiarity from school grades	Data often clusters; not true Likert by strict definition

5-Point Likert Scale (Most Common)

The 5-point scale is the default choice in most survey research because it balances nuance with simplicity. Example:

“The quality of food at XYZ Restaurant is excellent.”

Strongly Disagree
Disagree
Neither Agree nor Disagree
Agree
Strongly Agree

4-Point Likert Scale (Forced Choice)

Removing the neutral midpoint forces respondents to take a position. Use this when fence-sitting would undermine your research objective — for example, when measuring purchase intent or policy support where “no opinion” is not useful data.

Strongly Disagree
Disagree
Agree
Strongly Agree

6-Point Likert Scale

Like the 4-point, this eliminates the neutral option while providing more granularity. Useful in employee satisfaction or consumer preference research where a clearer lean is needed.

Strongly Disagree
Disagree
Slightly Disagree
Slightly Agree
Agree
Strongly Agree

7-Point Likert Scale

The 7-point scale is preferred in academic and psychological research where capturing subtle differences in attitude matters. It improves statistical reliability but requires more careful labeling.

Strongly Disagree
Moderately Disagree
Slightly Disagree
Neither Agree nor Disagree
Slightly Agree
Moderately Agree
Strongly Agree

When to Use a Likert Scale

A Likert scale is the right tool when you need to measure characteristics that have no objective measurement — attitudes, opinions, satisfaction levels, or perceived likelihood. It is not appropriate when:

A simple yes/no question would fully answer your research question
You are measuring factual behaviors (e.g., “How many times per week do you exercise?” — use a numerical input instead)
Respondents lack sufficient knowledge of the topic to have a genuine opinion

Use a Likert scale when you need to distinguish between degrees of agreement, not just direction. The difference between “Agree” and “Strongly Agree” often carries meaningful information in customer satisfaction and employee engagement research.

How to Design an Effective Likert Scale

Write Clear, Single-Focus Statements

Each item must address exactly one idea. A statement like “The service was fast and the staff were friendly” is a double-barreled item — the respondent may agree with one half and disagree with the other, making their response uninterpretable.

Balance Your Scale

A well-designed Likert scale includes an equal number of positively and negatively worded items. This counteracts acquiescence bias — the tendency of some respondents to agree with statements regardless of content. If all your items are positive, respondents who habitually agree will appear more satisfied than they actually are.

Avoid Leading Language

Avoid adverbs like “very,” “extremely,” or “always” inside the item statement itself. “This website is extremely fast” will yield fewer “Strongly Agree” responses than “This website is fast,” not because respondents think differently but because the bar is higher.

Keep the Scale Consistent Throughout the Survey

Switching between a 5-point and 7-point scale in the same questionnaire forces respondents to mentally reset and increases error rates. Choose one format and use it throughout.

Likert Scale Response Bias: What Can Distort Your Data

Understanding bias is not optional for anyone analyzing Likert data — it directly affects whether your conclusions are valid.

Bias Type	What Happens	How to Reduce It
Acquiescence bias	Respondents agree with statements regardless of content	Include negatively worded items; balance scale direction
Central tendency bias	Respondents cluster around the midpoint, avoiding extremes	Use an even-point scale to remove the neutral option when appropriate
Social desirability bias	Respondents choose the answer they think is most socially acceptable	Ensure anonymity; frame items neutrally
Extreme response bias	Some respondents always select the most extreme option	Use more scale points (7-point) to better distinguish genuine extremes

How to Analyze Likert Scale Data

Single Item vs. Multi-Item Scale: Different Rules Apply

This is the most commonly misunderstood part of Likert analysis. A single Likert item produces ordinal data — the intervals between response options are not guaranteed to be equal. Calculating a mean on ordinal data is statistically questionable. For a single item, use:

Median as your measure of central tendency
Frequency tables and percentages for distribution
Chi-square tests or Mann-Whitney U for group comparisons

A full Likert scale (summed or averaged across multiple items) behaves more like interval data, especially with 5+ items and a reasonable sample size. In this case, parametric statistics become more defensible:

Mean and standard deviation for descriptive summaries
Cronbach’s alpha (α) to test internal consistency — aim for α > 0.7
t-tests or ANOVA for group comparisons
Spearman correlation for relationships between Likert scores and other variables

Cronbach’s Alpha: Checking if Your Scale Holds Together

If you are using multiple Likert items to measure the same construct, run Cronbach’s alpha before reporting results. An alpha above 0.8 indicates strong internal consistency. Values between 0.7 and 0.8 are acceptable. Below 0.7 suggests your items are not measuring the same thing — revise or remove items with low item-total correlations.

Likert Scale Examples Across Research Domains

Customer satisfaction survey:

“How satisfied are you with the cleanliness of our facilities?”

Very Dissatisfied
Dissatisfied
Neither Satisfied nor Dissatisfied
Satisfied
Very Satisfied

Employee engagement survey:

“To what extent do you agree: ‘The new company policy enhances employee productivity’?”

Strongly Disagree
Disagree
Neither Agree nor Disagree
Agree
Strongly Agree

Online UX research:

“Rate your agreement: ‘The online shopping experience was user-friendly and intuitive.'”

Strongly Disagree
Disagree
Neither Agree nor Disagree
Agree
Strongly Agree

Advantages and Disadvantages of Likert Scales

Advantages	Disadvantages
Easy for respondents to understand and complete	Prone to acquiescence and social desirability bias
Produces quantitative data from subjective opinions	Ordinal data is not strictly interval — mean can be misleading
Flexible: measures agreement, frequency, satisfaction, likelihood	Central tendency bias reduces discrimination
Widely understood — high response rates	A single item cannot represent a full scale
Supports statistical analysis across groups	Does not capture why a respondent chose a particular point

Conclusion

A Likert scale is one of the most versatile and reliable tools in survey research — when used correctly. The key decisions are choosing the right number of response points for your research goal, writing items that are balanced and unambiguous, and applying the correct statistical method depending on whether you are working with a single item or a multi-item scale. Whether you are measuring customer satisfaction, employee engagement, student attitudes, or any other opinion-based construct, the principles remain the same: clarity in item wording, consistency in format, and honesty about what ordinal data can and cannot tell you.

Likert Scale.docs Word Document

Frequently Asked Questions

Q: What is the difference between a Likert scale and a Likert item?

A: A Likert item is a single rated statement. A Likert scale is the aggregate of multiple related items. The distinction matters for analysis: single items should use median and nonparametric tests; full scales can use mean and parametric tests.

Q: How do you pronounce Likert?

A: The correct pronunciation is “LICK-ert,” not “LIKE-ert.” It is named after Rensis Likert, who created the scale in 1932.

Q: Should I use a 5-point or 7-point Likert scale?

A: For general surveys and applied research, a 5-point scale is easier for respondents and produces reliable results. For academic or psychological research where detecting subtle attitude differences matters, a 7-point scale offers better statistical discrimination. Research comparing 5-point and 7-point scales finds that both produce similar mean scores once rescaled — so the choice depends more on respondent context than statistical superiority.

Q: Can you calculate the mean from Likert scale data?

A: For a single Likert item, technically no — the data is ordinal, so the median is more appropriate. For a complete Likert scale (multiple items summed), calculating the mean is widely practiced and generally acceptable, especially with a sample size above 30 and if the data distribution is approximately normal.

Q: What is acquiescence bias in Likert scales?

A: Acquiescence bias is the tendency of some respondents to agree with statements regardless of content. It is reduced by including both positively and negatively worded items in your scale, so that habitual agreement on one item is balanced by habitual agreement on an item that pulls in the opposite direction.

Q: Are Likert scale questions suitable for all types of research?

A: Likert scales work well in social sciences, market research, psychology, education research, healthcare, and UX research. They are not appropriate when you need objective behavioral counts or factual data — use open-ended questions or numerical inputs for those cases.

Q: Is it necessary to include a neutral response option in a Likert scale?

A: No. Including a neutral option (odd-point scale) allows genuinely ambivalent respondents to express that accurately. Removing it (even-point scale) forces a directional choice, which can reduce central tendency bias but may frustrate respondents who truly have no strong view. Choose based on whether neutrality is meaningful in your research context.

To leave a comment for the author, please follow the link and comment on their blog: RStudioDataLab.

R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.

Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

Continue reading: Likert Scale Questions: Your In-Depth Guide

2026 Rousseeuw Prize for Statistics Awarded to R Core Team for Transforming Statistics Computing Worldwide

Lauren Livingston — Thu, 18 Jun 2026 05:09:44 +0000

[This article was first published on R-posts.com, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)

Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

The Rousseeuw Prize honors five pioneering developers for nearly three decades of unpaid work building R, the foundational open-source computing language behind artificial intelligence, healthcare, and economic decision-making.

The $1 million Rousseeuw Prize for Statistics recognizes three decades of foundational work that transformed how statistical methods are developed, validated, and shared globally.
R, the open-source statistical computing language, underpins modern AI development, pharmaceutical research, financial modeling, and global scientific analysis.
Used by organizations including the U.S. Food and Drug Administration, major pharmaceutical companies, and global central banks, R has become the trusted infrastructure for high-stakes analysis because it is stable, auditable, and reproducible.

NEW YORK – June 17, 2026 — Five members of the R Core Team have been awarded the prestigious Rousseeuw Prize for Statistics for their decades of work building and maintaining the R Project, “R,” a free and open-source statistical computing language used across global research institutions, healthcare systems, financial organizations, and technology companies. The Rousseeuw Prize is an international award recognizing major contributions to statistical research.

The 2026 Rousseeuw Prize honorees are:

Brian D. Ripley, emeritus professor at the University of Oxford
Martin Maechler, emeritus professor at ETH Zurich
Kurt Hornik, department chair at WU Vienna University of Economics and Business
Peter Dalgaard, professor at Copenhagen Business School
Luke Tierney, professor at the University of Iowa

The five laureates receive half of the prize money because they are deemed to have made the longest sustained contributions to the R project; the other half of the prize is shared among the many others who have been active on the R Core Team.

Together, the R Project volunteers have spent the last 27 years and a collective 28,000 coding hours on R, developing an open-source programming language and software environment that transformed statistics from a proprietary corporate tool into a global public good. The software is relied upon by organizations including the U.S. Food and Drug Administration, pharmaceutical companies, and central banks such as the European Central Bank and the Bank of England.

The award recognizes the team’s role in making advanced statistical tools widely accessible. By keeping R free and open-source under the GNU General Public License, the R Core Team removed many of the financial barriers that have historically limited access to advanced analytics software. Due to this increased accessibility, hundreds of thousands of users including researchers, students, hospitals, public health organizations, and governments around the world are able to utilize the same statistical tools regardless of institutional resources. In addition, they use R to share transcripts of their data analyses, allowing one user’s workflows to power other users data analyses everywhere around the world. The frictionless spread of these transcripts has powered countless educational data science projects globally and hundreds of course textbooks at the PhD and Master’s level. In a recent twist, it’s not only humans who use R: AI data analyst `agents’ have been learning from the massive volume of published R transcripts and are now able to assist with many everyday data analysis tasks.

“Long before AI became a global conversation, the R Core Team was building the statistical infrastructure that made today’s data-driven world possible,” said Stanford University statistics professor and leading statistician David Donoho, PhD. “This team’s stewardship of R created an open and trusted foundation for research across disciplines and continents. Few innovations have had such a profound effect on how knowledge is produced, shared, and validated in the modern era.”

Named after Professor Peter Rousseeuw, a pioneering Belgian statistician known for his foundational work in robust statistics and data analysis, the Rousseeuw Prize for Statistics recognizes innovations that have transformed the understanding and application of data for the benefit of society. Past laureates include internationally renowned statisticians and researchers whose work has advanced fields ranging from epidemiology and artificial intelligence to public policy and scientific discovery.

For more information, visit https://www.rousseeuwprize.org/.

###

Media Contact:

rousseeuwprize@ampublicrelations.com

2026 Rousseeuw Prize for Statistics Awarded to R Core Team for Transforming Statistics Computing Worldwide was first posted on June 18, 2026 at 5:09 am.

To leave a comment for the author, please follow the link and comment on their blog: R-posts.com.

R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.

Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

Continue reading: 2026 Rousseeuw Prize for Statistics Awarded to R Core Team for Transforming Statistics Computing Worldwide

{talib}: Interactive financial charts

Serkan Korkmaz — Thu, 18 Jun 2026 05:09:11 +0000

[This article was first published on R-posts.com, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)

Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

{talib} is a new R package built on TA-Lib, which is now available on CRAN. The R-package is targeted at individuals and, perhaps, institutions who, in some form or the other, interacts with the financial markets using technical analysis.

The library is built with minimal dependencies for long-term stability and freedom in mind. All functions are built around data.frame– and matrix-classes which are portable to all other data-containers with minimal effort.

Everything in the library is built ‘bottom-up’ for maximum speed and memory efficiency. Each indicator interacts directly with R’s C API via .Call().

In this blog post I will give a brief introduction to the charting interface which is built to mimick the behaviour of base R’s plotting API.

A quick introduction to charts

In this section I will briefly introduce the most important aspects of the charting, ‘quality of life’-features and themes. Below is a simple starting point; charting BTC:

talib::chart(
  talib::BTC
)

chart() returns a candlestick chart by default. Below are the formals:

str(formals(talib::chart))
#> Dotted pair list of 5
#>  $ x    : symbol 
#>  $ type : chr "candlestick"
#>  $ idx  : NULL
#>  $ title: symbol 
#>  $ ...  : symbol

Modifying themes

talib::set_theme("hawks_and_doves")

talib::chart(
  talib::BTC
)

Charting indicators

{
  talib::chart(talib::BTC)
  talib::indicator(talib::SMA, n = 7)
  talib::indicator(talib::SMA, n = 14)
  talib::indicator(talib::SMA, n = 21)
  talib::indicator(talib::SMA, n = 28)
  talib::indicator(talib::MACD)
  talib::indicator(talib::trading_volume)
}

Installation

{talib} is finally on CRAN, and can be installed as follows:

install.packages("talib")

It can also be built from source with additional CMake-flags:

install.packages(
  "talib",
  type = "source",
  configure.args = "-O3 -march=native"
)

Contributing and submitting bug-reports

{talib} is still in its early stage so contributions, even if small, bug-reports, suggestions and critiques are gratefully accepted.

Visit the repository here: https://github.com/serkor1/ta-lib-R.

^{Created on 2026-04-29 with reprex v2.1.1}

{talib}: Interactive financial charts was first posted on June 18, 2026 at 5:09 am.

To leave a comment for the author, please follow the link and comment on their blog: R-posts.com.

R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.

Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

Continue reading: {talib}: Interactive financial charts

Announcing shiny.webawesome: a web UI package for R/Shiny

M. B. Anand — Thu, 18 Jun 2026 05:09:03 +0000

[This article was first published on R-posts.com, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)

Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

shiny.webawesome brings Web Awesome to R Shiny through generated wrappers, reactive bindings, and a bundled runtime. It aims for complete component support while staying close enough to upstream that the Web Awesome docs and examples are directly useful in everyday package use.

CRAN | R-universe | Package website | Source repository

Background

shiny.webawesome started from a perceived gap: Shiny would benefit from a UI library that feels modern, visually polished, and broad enough to support a full app coherently. Web Awesome was a strong fit because it combines rich components, layout and styling utilities, and detailed upstream documentation with a standards-based web-components structure that is straightforward to track from R. That makes it easier for the package to stay close to upstream while still fitting naturally into Shiny.

The Whole Game

Here’s a screenshot of a simple, complete example app using shiny.webawesome. The full live app and code are available in an article at: https://mbanand.github.io/ghpages/announcement/..

This example showcases many of the facilities available in the package:

a visually rich component library
direct use of Web Awesome layout utilities such as wa-stack, wa-cluster, wa-gap-*, and wa-align-* classes
styling through Web Awesome design tokens and classes such as --wa-color-*, --wa-font-*, and wa-body-*
reactive Shiny input bindings
helpers for calling methods on HTML elements, setting properties, and injecting simple JavaScript snippets

Design Philosophy

shiny.webawesome is designed to stay close to upstream Web Awesome. Most component wrappers are generated from Web Awesome metadata, which helps preserve upstream names, structure, and behavior while translating the interface into normal R conventions such as snake_case.

That close alignment has a practical benefit: when you want deeper details, examples, or component-specific guidance, you can usually go straight to the upstream Web Awesome documentation and apply what you find directly in shiny.webawesome. The package currently supports all Web Awesome components, so the upstream docs are a practical reference for day-to-day use.

To support the server-client model of Shiny, the package adds a small set of page and layout helpers, curated reactive bindings, and a narrow command layer for cases where browser-side interaction goes beyond the generated wrappers.

The result is a package with a clear default path. Use generated wrappers for ordinary UI, use bindings for meaningful reactive state, and reach for commands or small JavaScript glue when the app needs them.

Shiny Bindings

shiny.webawesome does not forward every browser event and every detail of component telemetry into Shiny. Much component state and interaction detail is better handled locally in the browser rather than turned into server messages. Consequently, the package exposes only a curated set of Shiny bindings that fit Shiny’s reactive model, with an emphasis on meaningful committed state rather than low-level browser event streams.

In the most common case, a binding publishes a durable semantic value. A select reports its current value, a dialog can report whether it is open, and a tree can report the currently selected item ids. The key idea is that Shiny receives the state the app actually cares about, not the raw event name that happened to produce it.

Some components are better treated as actions than values. A button is the clearest example: in Shiny, it behaves like a Shiny action input, with each click producing a new input event. A small number of components need both action semantics and a separate value. A dropdown, for example, may need to trigger reactivity on every choice, including repeated selections of the same item, while also exposing the latest selected value.

This design keeps reactive messaging to the server smaller, clearer, and easier to reason about. If an interaction belongs naturally in Shiny’s input model, shiny.webawesome will expose it as a binding. If it is more naturally a browser-side concern, it is usually a better fit for the command layer or a small amount of JavaScript glue.

For the full binding categories, semantics, and examples, see the package article: Shiny Bindings.

Command API

shiny.webawesome covers the most common interaction patterns through generated wrappers, Shiny bindings, and update helpers. But sometimes an app still needs to reach into a live browser element directly: set a property, call a method, or add a small browser-local JavaScript snippet.

For those cases, the package provides a narrow command API. The two main server-side helpers are wa_set_property() and wa_call_method(). They let Shiny code send one-way commands to a browser element identified by id, either by assigning a value to a live property or invoking a browser-side method.

If a component already has a binding or update helper, that should usually remain the first choice. The command layer is for the cases that fall just outside those built-in paths, where the simplest solution is still to tell the existing browser component to do one specific thing.

The package also includes wa_js() for a different kind of job: small, app-local JavaScript glue. That is useful when the missing piece is browser-side logic such as listening for an event, reading live component state, or publishing a derived value back to Shiny with Shiny.setInputValue().

For more detail and examples, see the package article: Command API.

Conclusion

shiny.webawesome brings a visually rich component library into Shiny while staying close to upstream Web Awesome. That combination gives polished components, useful layout and styling utilities, and a workflow where upstream documentation and examples remain directly relevant throughout app development.

For more examples, longer articles, and full reference material, see the package website: shiny-webawesome.org.

Announcing shiny.webawesome: a web UI package for R/Shiny was first posted on June 18, 2026 at 5:09 am.

To leave a comment for the author, please follow the link and comment on their blog: R-posts.com.

R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.

Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

Continue reading: Announcing shiny.webawesome: a web UI package for R/Shiny

RStudio AI That Doesn’t Cost a Penny: llmcoder vs. Posit AI Assistant

Shiyang Zheng — Thu, 18 Jun 2026 05:08:52 +0000

[This article was first published on R-posts.com, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)

Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

Introduction

If you’re an R user, you’ve probably experienced these moments:

You’re writing code and forgot the exact syntax for a function
Your code throws an error and you’re staring at a confusing error message
You have a block of code but want to understand what it does in plain English
You want to chat with an AI assistant about your data analysis, but don’t want to leave RStudio

llmcoder is an RStudio addin that solves all of these problems by integrating Large Language Model (LLM) assistance directly into your RStudio workflow, and more importantly, it’s FREE!

In this post, I’ll show you how llmcoder can speed up your R coding and make your workflow smoother.

Watch a quick demo of llmcoder in action:

https://youtu.be/SRzjaURbKCw

Installation

You can install llmcoder from GitHub:

# Install remotes if you haven't already
install.packages("remotes")

# Install llmcoder
remotes::install_github("ShiyangZheng/llmcoder")

Load the package:

library(llmcoder)

Feature 1: Generate R Code from Inline Comments

Ever wish you could just type what you want in plain English and get R code instantly?

How to use:

Type a comment describing what you want
Place your cursor on that line
Use the Addins menu and select “Generate Code from Comment”

Example:

# Load the mtcars dataset and create a scatter plot of mpg vs wt, colored by number of cylinders

After running the addin, the comment is replaced with:

library(ggplot2)
data(mtcars)
ggplot(mtcars, aes(x = wt, y = mpg, color = factor(cyl))) +
  geom_point(size = 3, alpha = 0.8) +
  labs(
    title = "Fuel Efficiency vs Weight by Cylinder Count",
    x = "Weight (1000 lbs)",
    y = "Miles per Gallon",
    color = "Cylinders"
  ) +
  theme_minimal()

No more switching to ChatGPT or copying code from Stack Overflow!

Feature 2: Fix Console Errors with LLM Assistance

We’ve all been there – a cryptic error message and you’re not sure what went wrong.

How to use:

Run code that produces an error
The error appears in the console
Use the Addins menu and select “Fix Error with LLM”

Example:

library(dplyr)
data %>%
  filter(cyl == 4) %>%
  summary()
# Error: object 'data' not found

llmcoder captures the error and sends it to the LLM, which returns an explanation and suggests:

mtcars %>% filter(cyl == 4) %>% summary()

Feature 3: Explain Selected Code in Plain English

Sometimes you inherit code from a colleague or find a Stack Overflow answer and want to understand what it does.

How to use:

Select a block of code in the editor
Use the Addins menu and select “Explain Code”

Example:

mtcars %>%
  group_by(cyl) %>%
  summarize(
    mean_mpg = mean(mpg, na.rm = TRUE),
    sd_mpg = sd(mpg, na.rm = TRUE),
    count = n()
  ) %>%
  arrange(desc(mean_mpg))

llmcoder returns:

Takes the built-in mtcars dataset
Groups the data by the number of cylinders (cyl)
Calculates the mean and standard deviation of miles per gallon (mpg) for each group
Arranges the results in descending order of mean fuel efficiency

Feature 4: Multi-Turn Chat Panel with Session Context

This is the flagship feature. llmcoder includes a Chat Panel that understands your current R session.

How to open: Use the Addins menu and select “Open Chat Panel”

What makes it special?

The Chat Panel is session-aware:

It knows which packages you have loaded
It knows what objects are in your global environment
It can read the contents of your current script
It has access to your recent console history

Example conversation:

You: What’s the correlation between mpg and wt in mtcars?

AI: The correlation between mpg and wt in the mtcars dataset is -0.87, indicating a strong negative relationship. As weight increases, fuel efficiency decreases.

cor(mtcars$mpg, mtcars$wt, use = "complete.obs")

Want to see the Chat Panel in action? Watch this demo:
https://youtu.be/zP-RuCN3q14

Supported LLM Providers

llmcoder supports multiple LLM providers – you can choose the one that works best for you:

Provider	API Key	Notes
OpenAI (GPT-4/3.5)	Yes	Most popular
Anthropic (Claude)	Yes	Great for long conversations
DeepSeek	Yes	Cost-effective
Groq	Yes	Very fast inference
Together AI	Yes	Open-source models
OpenRouter	Yes	Access multiple models
Ollama	No	Fully local, no API key!
Custom endpoint	Yes	LM Studio, vLLM, llama.cpp

Privacy note: If you use Ollama, all processing happens locally on your machine. No data is sent to external servers.

Customization: Choose Your Prompt Style

The Chat Panel allows you to select different prompt styles:

General Assistant: Best for general questions
R Code Helper: Focuses on writing clean, idiomatic R code
Statistics Advisor: Helps with statistical concepts and test selection
Research (Psycho): Tailored for psycholinguistics researchers

Why llmcoder?

There are many AI coding assistants out there (Copilot, Cursor, etc.), so why llmcoder?

Native RStudio integration: No need to switch to another app or browser tab
Session-aware: The LLM knows what you’re working on
Multiple LLM providers: Choose the one you prefer (or use a local model for privacy)
Open source: MIT license, free to use and modify
Designed for R users: Not a generic coding assistant – it understands R-specific workflows

Call to Action

Ready to try llmcoder?

remotes::install_github("ShiyangZheng/llmcoder")

GitHub: https://github.com/ShiyangZheng/llmcoder

If you encounter any bugs or have feature requests, please file an issue: https://github.com/ShiyangZheng/llmcoder/issues

Star the repo if you find it useful!

About the Author

Shiyang Zheng is a PhD student in Psycholinguistics at the University of Nottingham. His research focuses on idiom acquisition and computational modeling. He built llmcoder to make R coding easier for himself and the R community.

GitHub: @ShiyangZheng
Academic website: shiyangzheng.top
ORCID: 0000-0003-0511-4683

RStudio AI That Doesn’t Cost a Penny: llmcoder vs. Posit AI Assistant was first posted on June 18, 2026 at 5:08 am.

To leave a comment for the author, please follow the link and comment on their blog: R-posts.com.

R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.

Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

Continue reading: RStudio AI That Doesn’t Cost a Penny: llmcoder vs. Posit AI Assistant

New CRAN Package for sparse PCA – msPCA

Jean Pauphilet — Thu, 18 Jun 2026 05:08:40 +0000

[This article was first published on R-posts.com, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)

Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

The package msPCA is now available on CRAN!
It implements a new method for computing multiple sparse principal components of a dataset. Unlike other available packages, it generates PCs that are sparse and orthogonal, leading to a generally higher fraction of variance explained.

New CRAN Package for sparse PCA – msPCA was first posted on June 18, 2026 at 5:08 am.

To leave a comment for the author, please follow the link and comment on their blog: R-posts.com.

R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.

Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

Continue reading: New CRAN Package for sparse PCA – msPCA

TheseusPlot 0.3.0: Visualizing the Decomposition of Differences in Rate Metrics

Koji Makiyama — Wed, 17 Jun 2026 13:00:00 +0000

[This article was first published on HOXO-M Blog, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)

Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

TheseusPlot is an R package that decomposes differences in a rate metric between two groups into subgroup-level contributions and visualizes the results as a “Theseus Plot”.

For example, when a click-through rate, conversion rate, or retention rate differs between two time periods or groups, TheseusPlot helps answer questions such as: which subgroup contributed most to the difference?

Suppose that the click-through rate (CTR) was 6.2% in 2024 and 5.2% in 2025, a decrease of 1.0 percentage point. A Theseus Plot can show how this decrease is decomposed: in this example, 0.8 percentage points are assigned to male users and 0.2 percentage points to female users under the decomposition.

Version 0.3.0 is now available on CRAN. This release fixes a compatibility issue with waterfalls 1.1.4, improves subgroup size bar rendering, and refines several plot defaults.

What’s new in 0.3.0

Cleaner plot labels

In earlier versions, TheseusPlot automatically displayed the analyzed column name as a subtitle. However, this was not always useful, especially when the plot was already used in a document or presentation where the context was clear.

In version 0.3.0, the automatic column-name subtitle has been removed. This makes the resulting plots cleaner and easier to combine with custom titles, captions, and surrounding text.

This release also adds an xlab argument to create_ship(), so you can customize the x-axis label used by plot() and plot_flip().

For example:

ship <- create_ship(
  data_2024,
  data_2025,
  y = clicked,
  labels = c("2024", "2025"),
  xlab = "Gender",
  ylab = "CTR (%)"
)

ship$plot(gender)

This is useful when the column name in the data is short or technical, but you want a more readable label in the plot.

Better default labels

The default group labels have been changed from "Original" and "Refitted" to "Baseline" and "Comparison".

ship <- create_ship(
  data_2024,
  data_2025,
  y = clicked
)

ship$plot(gender)

The previous labels reflected the internal idea of replacing one group with another, but they were not always intuitive for users. The new defaults better match common comparison scenarios, such as year-over-year comparisons, control versus treatment, and before-and-after analyses.

Of course, you can still specify your own labels:

ship <- create_ship(
  data_Nov,
  data_Dec,
  y = on_time,
  labels = c("November", "December")
)

Simpler numeric display

The default number of displayed decimal places has been changed from 3 to 1.

In many plots, three decimal places made the labels more detailed than necessary. Since TheseusPlot is mainly intended to help users understand the structure of a metric difference, one decimal place is often enough for visual interpretation.

You can still control the precision with the digits argument when needed.

ship <- create_ship(
  data_2024,
  data_2025,
  y = clicked,
  labels = c("2024", "2025"),
  digits = 2
)

ship$plot(gender)

Plot improvements and bug fixes

Version 0.3.0 also includes several improvements and bug fixes related to plot rendering.

First, missing subgroup size bars in plot() and plot_flip() with waterfalls 1.1.4 have been fixed. Subgroup size bars are an important part of Theseus Plots because they show the sample size of each subgroup in both groups. Without them, it becomes harder to judge whether a large contribution comes from a large subgroup, a large rate difference, or both.

Second, subgroup size bar scaling has been improved. Bar heights are now computed consistently from the maximum plot score in both plot() and plot_flip(). This makes visual comparisons more stable across plot directions. The maximum height of these bars can still be controlled with the bar_max_value argument.

Third, text_size handling has been fixed when applying the current ggplot2 theme. This makes text scaling more predictable when users customize plot themes.

ship <- create_ship(
  data_2024,
  data_2025,
  y = clicked,
  labels = c("2024", "2025"),
  text_size = 1.5
)

ship$plot(gender)

Installation

You can install TheseusPlot from CRAN with:

install.packages("TheseusPlot")

Try it out

TheseusPlot is useful when you want to understand why rate metrics differ between two groups.

Typical examples include:

click-through rate
conversion rate
retention rate
success rate
error rate

For details, please see the package website:

https://hoxo-m.github.io/TheseusPlot/

To leave a comment for the author, please follow the link and comment on their blog: HOXO-M Blog.

R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.

Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

Continue reading: TheseusPlot 0.3.0: Visualizing the Decomposition of Differences in Rate Metrics

How I Used One-Way ANOVA in R to Analyze Crop Yield Data for a PhD Student (Real Case Study)

Unknown — Wed, 17 Jun 2026 10:12:31 +0000

[social4i size="small" align="align-left"] -->

[This article was first published on RStudioDataLab, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)

Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

My client’s supervisor had rejected Chapter 4 twice. Not because the data was bad — the field trial was clean, the yield measurements precise. The problem was the statistics. And until I looked at the file, the student had no idea what was actually wrong.

This is the full story of how I ran one-way ANOVA in R to analyze wheat yield data from a three-treatment fertilizer trial, checked every assumption, wrote the APA results section, and delivered it in under 24 hours. I have helped over 500 researchers through this exact kind of problem. Here is what the process actually looks like.

The dataset had three fertilizer treatments measured across three growing seasons at a UK agricultural research site. The student needed to know whether treatment type significantly affected crop yield — and which specific treatments differed from each other. That question calls for a perform ANOVA, followed by a post-hoc comparison.

Info!Sound familiar? I can run this exact analysis on your data and deliver the full APA results section in 24 hours — WhatsApp me now →

Table of Contents

Auditing LLM Trading: Bridging Theory and Market Reality with the GT table in R

Selcuk Disci — Wed, 17 Jun 2026 08:05:04 +0000

[This article was first published on DataGeeek, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)

Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

Introduction: The Laboratorial Illusion

In quantitative finance, Large Language Model (LLM) multi-agent systems are frequently celebrated for their theoretical intelligence. Financial data scientists spend months refining prompt semantics, building complex reasoning frameworks, and engineering multi-turn debate loops between specialized agent nodes. On paper—and within simulated environments—these networks demonstrate flawless predictive capabilities, capturing theoretical alpha with pristine efficiency.

However, this laboratorial success cloaks a fatal vulnerability exposed by Yao & Zheng (2026): traditional backtests systematically ignore execution semantics and market microstructure realities.

In AI-driven trading systems, the primary risk is no longer the raw quality of the agent’s alpha signal; it is the cognitive latency required to generate that signal. While classical high-frequency algorithms fight a war of microseconds, LLM multi-agent networks engage in multi-second internal debates. When this cognitive inertia is forced to execute within highly volatile regimes, it transforms directly into a silent alpha killer. Yao & Zheng (2026) forces us to stop judging agent architectures by their abstract zekası, and start auditing them by the brutal financial reality of their execution timing.

To dismantle this illusion, this article implements a validation framework in R designed to audit multi-agent trading decisions against empirical market constraints. Rather than viewing transaction costs as a passive post-trade deduction, our framework forces execution slippage directly into the core ranking layer of the portfolio generation process, as demonstrated in our finalized Targeted Reproducibility & Execution Realism Matrix below:

Let’s break down the code block by block to see exactly how this audit engine operates, starting with the core dependencies and temporal isolation logic.

Part 2: Environment Setup & The Auditing Interface

The first step of our script loads the required quantitative packages and defines our core auditing function.

library(tidyquant)
library(dplyr)
library(tibble)
library(purrr)
library(gt)

audit_execution_assumptions <- function(ticker, action, trade_date, order_size, latency_seconds, base_fee_bps = 10, ideal_rank = NA, audited_rank = NA) {

Deconstructing the Operational Parameters

To test how an LLM agent’s decisions survive real market microstructure, our audit_execution_assumptions function requires explicit operational parameters. Here is the practical quantitative intuition behind each input:

ticker: The asset symbol being audited (e.g., "AMD", "TSLA"). It tells the engine exactly which market pricing stream to fetch.
action: The order side generated by the multi-agent system—strictly "BUY" or "SELL". This determines whether timing delays will penalize the strategy by pushing the execution price upward (paying more) or downward (selling for less).
trade_date: The exact calendar day of the intended trade ("YYYY-MM-DD"). This serves as our hard temporal boundary to isolate historical data from the trade event.
order_size: The volume of shares being transacted. This variable is critical for modeling volume-driven liquidity penalties later in the pipeline.
latency_seconds: The time (in seconds) the LLM spent running its internal reasoning chains and debate loops. This is the master variable driving our time-based slippage penalty.
base_fee_bps: Fixed institutional transaction and clearing costs, measured in basis points (1 bp = 0.01%). It defaults to a standard institutional rate of 10 bps.
ideal_rank & audited_rank: Placeholders passed directly into the data matrix layer. ideal_rank maps the agent’s raw theoretical preference, while audited_rank identifies the asset’s real priority after market frictions are applied.

Part 3: Point-in-Time Control & Temporal Split Discipline

Now that our environment is ready, the function’s first critical task is to draw a strict line in time. It isolates historical data from the execution day data to ensure that future prices cannot leak into our calculations.

# 1. Point-in-Time Control & Temporal Split Discipline
  end_date <- as.Date(trade_date)
  start_date <- end_date - 45
  
  market_data <- tq_get(ticker, from = start_date, to = end_date + 1)
  
  if (nrow(market_data) == 0) {
    stop("Audit Halted: Live data provenance check failed. Verify market calendar.")
  }
  
  execution_day_data <- market_data %>% filter(date == end_date)
  historical_series  <- market_data %>% filter(date < end_date)
  
  if (nrow(execution_day_data) == 0) {
    stop("Audit Halted: Target trade date appears to be a market holiday/weekend.")
  }
  
  arrival_price <- execution_day_data$open[1]

Understanding the Internal Compliance Variables

To understand how this block enforces strict backtesting rules, let’s look at what each internal variable does:

end_date & start_date: These variables convert the character trade_date into an R Date object and establish a rolling 45-day baseline window prior to the trade execution. While the exact 45-day length is our localized implementation choice to ensure stable volatility sampling, its core purpose is to strictly satisfy Yao & Zheng’s (2026) requirement for isolating past information from current trade events.
market_data: The raw data table downloaded via tidyquant. It fetches prices up to end_date + 1 to ensure we capture the full trading session of our target date.
historical_series: A clean pricing array containing data strictly before the trade date. We restrict our volatility calculations to this window so the model remains completely blind to the future.
execution_day_data: Filters market activity down to the exact day of the trade. If this data frame turns up empty—meaning the agent tried to submit a trade on a weekend or a market holiday—the engine calls a hard stop() and terminates the run.
arrival_price: The stock’s open price on the execution day. This represents the pristine price available at the exact second the agent finishes its logic, serving as our baseline anchor before any market frictions are calculated.

Part 4: Mathematical Volatility & Timing Slippage Modeling

Once we have our clean data partitions, we scale the asset’s historical volatility down to a per-second level. This allows us to convert the agent’s cognitive delay directly into a financial price penalty.

# 2. Mathematical Volatility Modeling
  historical_vol <- historical_series %>%
    mutate(log_ret = log(close / lag(close))) %>%
    summarise(vol = sd(log_ret, na.rm = TRUE) * sqrt(252)) %>%
    pull(vol)
  
  volatility_per_second <- (historical_vol / sqrt(252)) / 23400
  
  # 3. Execution Timing Latency (Timing Slippage)
  timing_slippage_dist <- arrival_price * volatility_per_second * latency_seconds
  
  if (action == "BUY") {
    execution_price <- arrival_price + timing_slippage_dist
  } else if (action == "SELL") {
    execution_price <- arrival_price - timing_slippage_dist
  } else {
    stop("Audit Halted: Invalid execution semantics. Side must be BUY or SELL.")
  }

Deconstructing the Mathematical Variables

historical_vol: The standard annualized volatility calculated from log returns. It represents the asset’s baseline speed of movement over a normal trading year.
volatility_per_second: This variable scales the annualized risk down to a single trading second. It divides the daily volatility by 23,400, which is the exact number of seconds in a standard 6.5-hour US market session (6.5 x 3600$).
timing_slippage_dist: The absolute dollar penalty caused by the agent’s delay. It multiplies our per-second volatility by latency_seconds.
execution_price: The real, degraded price our trade hits. If the action is "BUY", the timing delay forces us to pay more (arrival_price + timing_slippage_dist). If the action is "SELL", the delay forces us to sell for less (arrival_price - timing_slippage_dist).

Part 5: Institutional Friction & Turnover Cost Modeling

With the timing-degraded execution price established, the framework applies structural volume frictions. This step calculates fixed brokerage costs alongside non-linear market impact caused by our position size.

# 4. Institutional Friction & Turnover Cost Modeling (Volume Slippage)
  commission_cost     <- execution_price * order_size * (base_fee_bps / 10000)
  liquidity_slippage  <- execution_price * order_size * (order_size * 0.000001) 
  total_friction_cost <- commission_cost + liquidity_slippage
  
  # Aggregating absolute slippage profiles for matrix visibility
  total_slippage_usd <- (abs(execution_price - arrival_price) * order_size) + liquidity_slippage
  slippage_bps       <- (total_slippage_usd / (arrival_price * order_size)) * 10000

Deconstructing the Friction Variables

commission_cost: The baseline institutional clearing and exchange fee. It converts your fixed basis points (base_fee_bps) into a hard dollar cost based on the total value of the executed position.
liquidity_slippage: A non-linear market impact model. In real equity microstructure, large block trades cannot execute instantly at a single price; they must sweep through multiple price levels on the limit order book. The formula multiplying order_size by 0.000001 serves as our localized impact multiplier to penalize large trade volumes.
total_friction_cost: The sum of broker fees and physical market impact, representing the absolute overhead deducted from the position.
total_slippage_usd: The total dollar amount lost to market mechanics. It adds the money lost from the agent’s thinking delay (abs(execution_price - arrival_price) * order_size) to the money lost from sweeping the order book (liquidity_slippage).
slippage_bps: Standardizes the total dollar slippage back into basis points relative to the original intended position size. This allows us to compare execution damage cleanly across symbols with entirely different stock prices.

Part 6: Reproducibility Grading & Data Ingestion Matrix Output

Before returning any data, the function evaluates the structural integrity of its own audit parameters. It grades the calculation setup out of 100% to ensure the backtest is completely realistic, and then outputs a clean data row.

# 5. Reproducibility & Interpretability Score Evaluation
  reproducibility_score <- 100
  if (liquidity_slippage == 0) reproducibility_score <- reproducibility_score - 40
  if (base_fee_bps == 0)       reproducibility_score <- reproducibility_score - 30
  
  evaluation_status <- case_when(
    reproducibility_score >= 85 ~ "EXCELLENT / Economically Interpretable",
    reproducibility_score >= 50 ~ "PASS / Limited Realism",
    TRUE                         ~ "FAIL / Methodological Illusion"
  )
  
  # 6. Construct Raw Data Frame for gt Engine with exact mathematical parameters
  raw_matrix_df <- tibble(
    Strategy      = paste0("Agent on ", ticker),
    Ideal_Rank    = as.integer(ideal_rank),
    Audited_Rank  = as.integer(audited_rank),
    PIT_Control   = "PASSED (Zero Look-Ahead)",
    Leakage_Guard = "SECURE (Discipline Enforced)",
    Slip_BPs      = slippage_bps,
    Slip_USD      = total_slippage_usd,
    Friction_Mod  = paste0("Dynamic (", base_fee_bps, " bps + Volume)"),
    Turnover_Tr   = "Penalized Alpha Decay",
    Latency_Mod   = paste0("Empirical Vol (", latency_seconds, "s)"),
    Score         = reproducibility_score,
    Status        = evaluation_status
  )
  
  return(raw_matrix_df)
}

Understanding the Structural Matrix Variables

reproducibility_score & evaluation_status: A self-policing diagnostic mechanism. If a user tries to run a backtest with no fees or no volume penalties, the engine deducts points. A score below 50 flags the setup as a Methodological Illusion, warning you that the strategy looks profitable simply because it is ignoring real-world trading costs.
raw_matrix_df: The core data frame returned by the function. Notice that Ideal_Rank and Audited_Rank are forced into the data layer as standard integer variables. This ensures our portfolio analytics are handled strictly at the data layer before any styling or formatting takes place.

Part 7: High-Density Portfolio Execution Flow (The Simulation Sandbox)

Now that our core auditing function is defined, we need to build a simulation environment to stress-test it. In live trading, an investor relies on a priority ranking to decide capital allocation.

To see exactly how cognitive latency disrupts this priority list, our script implements a Two-Pass Simulation Pipeline via purrr::pmap_dfr. Pass 1 runs a localized sweep to gather raw market frictions across a simulated portfolio, and Pass 2 injects those generated frictions back into the function to establish the final, adjusted priority order.

# ==============================================================================
# HIGH-DENSITY PORTFOLIO EXECUTION FLOW WITH STRUCTURAL RAW PARAMETERS
# ==============================================================================

# 1. Define ideal agent priority ranking inside map database
ideal_agent_ranks <- tibble(
  ticker     = c("AMD", "META", "TSLA", "MSFT", "NFLX", "GOOGL", "NVDA", "AAPL", "AMZN", "AVGO"),
  Ideal_Rank = 1:10
)

# 2. Phase 1: Temporary execution execution mapping to capture raw slippage arrays
set.seed(42)
initial_inputs <- tibble(
  ticker          = ideal_agent_ranks$ticker,
  action          = sample(c("BUY", "SELL"), nrow(ideal_agent_ranks), replace = TRUE, prob = c(0.6, 0.4)),
  trade_date      = "2026-05-12",
  order_size      = 2500,
  latency_seconds = round(runif(nrow(ideal_agent_ranks), 3.5, 7.5), 1),
  base_fee_bps    = 10,
  ideal_rank      = ideal_agent_ranks$Ideal_Rank
)

# Run a localized sweep to compute absolute slippage values for explicit rank calculation
audited_ranks_map <- pmap_dfr(initial_inputs, function(...) {
  args <- list(...)
  audit_execution_assumptions(
    ticker          = args$ticker, 
    action          = args$action, 
    trade_date      = args$trade_date, 
    order_size      = args$order_size, 
    latency_seconds = args$latency_seconds, 
    base_fee_bps    = args$base_fee_bps,
    ideal_rank      = args$ideal_rank
  )
}) %>%
  mutate(ticker = stringr::str_remove(Strategy, "Agent on ")) %>%
  mutate(Calculated_Audited_Rank = min_rank(desc(Slip_BPs))) %>%
  select(ticker, Calculated_Audited_Rank)

# 3. Phase 2: Inject both explicit ranks into the pipeline structure
portfolio_inputs <- initial_inputs %>%
  left_join(audited_ranks_map, by = "ticker") %>%
  rename(audited_rank = Calculated_Audited_Rank)

# 4. Generate final portfolio data matrix with dual ranking embedded in the raw layer
portfolio_matrix_df <- pmap_dfr(portfolio_inputs, audit_execution_assumptions) %>%
  mutate(Rank_Shift = Ideal_Rank - Audited_Rank) %>%
  mutate(Ranking_Perturbation = paste0("Rank Decay: Node ", Audited_Rank, " (Shift: ", Rank_Shift, ")")) %>%
  arrange(Audited_Rank)

Deconstructing the Simulation Logic & Generated Variables

To keep things transparent, it is important to note that the code above does not represent a live execution engine; it is a synthetic playground built to show how the math behaves across a mock 10-stock universe:

ideal_agent_ranks: This is our baseline control vector. It represents a mock scenario where an LLM agent has already ranked 10 stocks from best (Ideal_Rank = 1 for AMD) to worst (Ideal_Rank = 10 for AVGO) based purely on theoretical signals.
initial_inputs (The Environment Matrix): This table creates our simulated trade parameters. It forces every stock to trade an identical block of 2500 shares on a fixed historical date (2026-05-12). Crucially, we use runif(..., 3.5, 7.5) to simulate a random cognitive delay between 3.5 and 7.5 seconds—perfectly mimicking the time an LLM spends traversing multi-turn debate loops or long reasoning chains before hitting the market.
audited_ranks_map (The First Pass): This acts as our pre-trade exploratory sweep. Because we cannot rank the stocks by execution damage until we know what that damage is, this pass calls our function to calculate the raw absolute Slip_BPs for each asset. It then uses min_rank(desc(Slip_BPs)) to generate Calculated_Audited_Rank—sorting the stocks based on how well they survived slippage.
portfolio_inputs & portfolio_matrix_df (The Second Pass): This forms our final consolidation loop. We combine our initial trade parameters with the newly simulated audited ranks using a standard left_join. Then, we run the auditing function one final time to bake both ranking layers cleanly into the final output.
Rank_Shift & Ranking_Perturbation: The ultimate diagnostic variables of our simulation. By subtracting the final audited position from the agent’s initial ideal position, these fields explicitly capture Rank Decay—showing the reader exactly how many slots an asset fell due to the toxic combination of its own volatility and the agent’s processing delay.

Part 8: The Professional Visualization Layer (Renderer)

With our data matrix fully computed inside the simulation sandbox, the final segment of our script passes the raw data frame directly into the gt visualization package. This block formats numbers, colors labels, and applies conditional logic to transform our raw tibble into the high-density corporate matrix seen in our audit results.

# ==============================================================================
# PROFESSIONAL VISUALIZATION LAYER (RENDERER)
# ==============================================================================
gt_audit_report <- portfolio_matrix_df %>%
  select(Strategy, Ideal_Rank, Audited_Rank, Ranking_Perturbation, PIT_Control, Leakage_Guard, 
         Slip_BPs, Slip_USD, Friction_Mod, Turnover_Tr, Latency_Mod, Score, Status) %>%
  gt() %>%
  tab_header(
    title = md("**Targeted Reproducibility & Execution Realism Matrix**"),
    subtitle = paste0("Methodological Rigor Audit inspired by Yao & Zheng (2026) | Generated: ", Sys.Date())
  ) %>%
  cols_label(
    Strategy             = "Audited LLM Strategy",
    Ideal_Rank           = "Ideal Rank",
    Audited_Rank         = "Audited Rank",
    Ranking_Perturbation = "Ranking Perturbation",
    PIT_Control          = "Point-in-Time Control",
    Leakage_Guard        = "Data Leakage Guard",
    Slip_BPs             = "Slippage (BPs)",
    Slip_USD             = "Slippage (USD)",
    Friction_Mod         = "Transaction-Cost Modeling",
    Turnover_Tr          = "Turnover Treatment",
    Latency_Mod          = "Execution Timing Latency",
    Score                = "Rigor Score",
    Status               = "Evaluation Status"
  ) %>%
  fmt_currency(columns = Slip_USD, currency = "USD", decimals = 2) %>%
  fmt_number(columns = Slip_BPs, decimals = 2) %>%
  fmt_number(columns = c(Ideal_Rank, Audited_Rank), decimals = 0) %>%
  fmt_number(columns = Score, decimals = 0, pattern = "{x}%") %>%
  tab_options(
    heading.title.font.size = px(18),
    heading.subtitle.font.size = px(13),
    column_labels.font.weight = "bold",
    column_labels.background.color = "#F4F6F7",
    table.font.names = "Arial, sans-serif",
    data_row.padding = px(6),
    table.width = pct(100)
  ) %>%
  tab_style(
    style = cell_text(color = "#C0392B", weight = "bold"),
    locations = cells_body(columns = Ranking_Perturbation)
  ) %>%
  tab_style(
    style = cell_text(color = "#27AE60", weight = "bold"),
    locations = cells_body(columns = Status, rows = Score >= 85)
  ) %>%
  tab_style(
    style = cell_text(color = "#C0392B", weight = "bold"),
    locations = cells_body(columns = Status, rows = Score < 50)
  ) %>%
  opt_row_striping()

# Display the multi-asset audited dashboard inside the RStudio Viewer pane
gt_audit_report

Deconstructing the Presentation & Formatting Variables

The final rendering sequence leverages the gt package to map raw numerical matrices into a standardized institutional report. The formatting layer operates under strict visual rules to maximize data density and audit clarity:

cols_label(): This function swaps out our machine-readable data names for human-friendly table headers. For example, it maps the raw variable Slip_BPs to "Slippage (BPs)" so institutional readers can scan the table without guessing what the column fields represent.
fmt_currency() & fmt_number(): These are our value formatters. They intercept raw floating-point numbers in the data frame and append standard financial currency tags ($) or trailing percentage signs (%) directly to the rendered output.
tab_options(): Controls the structural design and geometry of the table. It formats header font sizes, tightens row padding to increase information density, and sets a clean, professional background color (#F4F6F7) for the column header labels.
tab_style(): Enforces data-driven visual rules. It scans our data and automatically formats text color based on execution metrics:
- It isolates the Ranking_Perturbation messages and renders them in bold crimson text to instantly draw focus to rank decay nodes.
- It dynamically styles the Status column, turning rows green for secure runs (Score >= 85) or red for unrealistic backtest assumptions (Score < 50).
opt_row_striping(): Generates alternating zebra striping across rows, allowing readers to track complex metrics across broad horizonal rows seamlessly.

Conclusion: Reclaiming Empirical Rigor

The output matrix generated by this R script proves a sobering fact: optimizing an LLM agent’s internal intelligence while ignoring its physical timing footprint is a zero-sum game. When cognitive latency meets volatile market microstructure, theoretical priority hierarchies collapse.

By pushing dynamic slippage parameters directly into your research data layer rather than treats them as a post-trade footnote, you can accurately strip away laboratorial illusion. Quantitative researchers must stop asking how smart their financial agents are, and start measuring how fast those agents’ decisions decay on the trade desk.

To leave a comment for the author, please follow the link and comment on their blog: DataGeeek.

R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.

Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

Continue reading: Auditing LLM Trading: Bridging Theory and Market Reality with the GT table in R

Bioconductor-centric hackathon on spatial omics and image-derived data

Davide Risso — Wed, 17 Jun 2026 00:00:00 +0000

[This article was first published on Bioconductor community blog, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)

Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

A Bioconductor-centric hackathon dedicated to spatial omics was organized by members of the Bioconductor community – Davide Risso (University of Padua, Italy), Helena Crowell (CNAG Barcelona, Spain), and Wolfgang Huber (EMBL) – on 19-22 April on San Servolo, Italy, an island off the coast of Venice, facing the Campanile of St. Mark’s Square.

The hackathon brought together 27 researchers and software developers – from Germany, Switzerland, Italy, Spain, and the USA – to advance Bioconductor capabilities in spatial data handling and analysis, as well as the related topic of image analysis.

Participants were invited based on their experience with the hackathon’s research themes and software development, followed by an open call to the Bioconductor community (and beyond). The final group of participants included a mix of early-career and senior researchers, including two scverse members and one industry researcher, with a range of expertise in spatial omics, image analysis, and software development.

Picture time on a terrace overlooking St. Mark’s Square from San Servolo island. (Back:) Elisabeth Purdom, Wolfgang Huber, Pere Moles Serò, Rafael Irizarry, Helena Crowell, Martin Emons, Dario Righelli, Juan Henao, Sean Davis, Gabriele Sales, Mike Smith, Ilaria Billato, Patrick Danaher, Hugo Gruson, Carissa Chen, Daria Lazic, Luca Marconato, Artür Manukyan. (Front:) Davide Risso, Sviatoslav Kharuk, Michael Stadler, Samuel Gunz, Robert Castelo, Charlotte Soneson, Matteo Calgaro, Gabriel Grajeda, Riccardo Ceccaroni.

The hackathon centered on spatial omics and other bioimaging data, with emphasis on data representation, interoperable serialization, scalable data handling, Python interoperability, interactive visualization. The hackathon ran over three days with the majority of the time spent in teams who independently developed and implemented a plan that addressed a challenge or met a goal important to team members.

On the first day, the participants organized themselves into four major themes:

Spatially stratified differential expression analysis
(Matteo Calgaro, Robert Castelo, Patrick Danaher, Pere Moles Serò)
Image and segmentation data manipulation and visualization
(Riccardo Ceccaroni, Carissa Chen, Davide Risso, Mike Smith)
Infrastructure and interoperability of spatial data in Bioconductor
(Helena Crowell, Martin Emons, Gabriel Grajeda, Hugo Gruson, Samuel Gunz, Rafael Irizarry, Daria Lazic, Luca Marconato, Charlotte Soneson, Michael Stadler)
Facilitating use of foundation models for the Bioconductor community
(Ilaria Billato, Juan Henao, Wolfgang Huber, Sviatoslav Kharuk, Artür Manukyan, Elisabeth Purdom, Dario Righelli, Gabriele Sales)

Each day started with a brief session in which each team set up goals for the day. Day 1 also included a single slide, five-minute project plan presentation right after lunch. This presentation mid-day served to help teams develop a focused project quickly, with the understanding that the project plan would likely change over the next 2 days.

Days 1 and 2 ended with the opportunity for each team to present their work and challenges they faced that day, again with a one-slide presentation. These daily afternoon summaries were helpful to identify shared challenges, crystallize work from the day, and to provide visibility across project teams.

On the second day, the group journeyed across the water for a stroll through the streets of Venice towards Italian dinner. This group picture was taken on St. Mark’s Square (Piazza San Marco), featuring St. Mark’s Basilica and Campanile (bell tower) in the background.

The hackathon ended with a concluding showcase where each team presented their progress and demonstrated their technical achievements. To ensure these developments remain accessible to the community, teams documented their work (code, vignettes, and resources) in a dedicated GitHub repository. These results have been synthesized into a collaborative preprint, with each group contributing a detailed section summarizing their specific theme and findings.

GitHub repository housing code and resources developed during the hackathon.
Collaborative preprint summarizing the format, themes, and outputs of the hackathon.

The event was organized by the Department of Statistical Sciences of the University of Padova in collaboration with EMBL and Venice International University, funded in part by the European Research Council (ERC) Grant CoG 101171662, and supported by EMBL’s Transversal Theme Theory@EMBL.

To leave a comment for the author, please follow the link and comment on their blog: Bioconductor community blog.

R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.

Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

Continue reading: Bioconductor-centric hackathon on spatial omics and image-derived data

Bioconductor Maintainer Validation

Lori Shepherd-Kern — Tue, 16 Jun 2026 00:00:00 +0000

[This article was first published on Bioconductor community blog, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)

Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

Introduction

Bioconductor policies include being an active and reachable maintainer. Maintainer emails in the DESCRIPTION of packages often go stale as maintainers change positions. There is also a necessity to have maintainers opt into Bioconductor policies and procedures as they change over time.

We have created an application that uses Amazon Simple Email Service (SES) to send periodic emails to maintainers to check if the endpoint is reachable and to send a verification opt-in of Bioconductor current policies and procedures and code of conduct once a year.

Initial feedback is that this email is “spammy” and may be marked as such by institutions, but it is an initial attempt at compliance. We will look at alternatives to emails like specialized maintainer account access at a future date.

Access to Information

The information is in a publicly accessible database. We do not recommend connecting directly to the webservice but instead using the accompanied Bioconductor R package BiocMaintainerApp. It provides a Shiny application interface for querying Bioconductor package maintainers’ information.

Thank you

We appreciate maintainers’ cooperation moving forward.

To leave a comment for the author, please follow the link and comment on their blog: Bioconductor community blog.

R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.

Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

Continue reading: Bioconductor Maintainer Validation

New Package Submission Process

Lori Shepherd-Kern — Mon, 15 Jun 2026 00:00:00 +0000

[This article was first published on Bioconductor community blog, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)

Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

Introduction

Bioconductor is moving towards using R-universe for its daily build system. See our previous blog post Collaborating between Bioconductor and R-universe on Development of Common Infrastructure. As we move in this direction it was also necessary to update the submission process for Bioconductor packages. While the daily builders are still transitioning, the new submission process location is now live. The new system utilizes GitHub Actions to trigger review milestones and R-Universe as the build/check backend. The new system provides a smoother experience; it is more automated and avoids administrative steps that have historically bottlenecked the review process.

Information

Location:

The new location for submitting new packages to Bioconductor for review is BiocContributions. This replaces the old location at Bioconductor/Contributions.

Documentation

There is documentation on What to Expect as well as a detailed Slide Deck.

There is also a FAQ for commonly asked questions, concerns, or troubleshooting.

If you need to report an issue with the new system, please open an Issue on the BiocSubmissionProcess GitHub repository.

What about Bioconductor/Contributions

The submission location at Bioconductor/Contributions has been frozen and will no longer accept new issues. BiocContributions replaces this location. If you already submitted to the old location, if you are assigned a reviewer, your review will finish there. If you have not been assigned a reviewer yet, we will be posting shortly to close out your submission and move to the new location.

Easier Reproducibility

One of the frequent comments we receive is how do we reproduce the results of the build reports Bioconductor creates. The switch to using R-universe as the building and checking backend allows for a reproducible testing environment. Any maintainer can apply R-Universe checking on their personal GitHub repository for a Bioconductor package by following these instructions. This allows for a maintainer to test before submitting to Bioconductor and testing any future changes before pushing directly to Bioconductor.

Thank you!

We appreciate your patience and understanding as we transition to the new system.

To leave a comment for the author, please follow the link and comment on their blog: Bioconductor community blog.

R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.

Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

Continue reading: New Package Submission Process

Why we still need {admiral} in an age of AI

Jeff Dickinson — Sun, 14 Jun 2026 00:00:00 +0000

[This article was first published on pharmaverse blog, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)

Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

There is a version of the AI-in-pharma story that goes like this: LLMs are trained on vast amounts of R code, so they can write ADaM programs on demand. Packages like {admiral} become optional — a style preference rather than a requirement. Just describe what you need and let the model figure it out.

The benchmark data from pharma-skills tells a different story.

What an Unskilled Agent Actually Does

When an AI coding agent is asked to derive an ADAE dataset without access to {admiral} skill guidance, it does not reach for derive_var_trtemfl() or derive_vars_merged() with the correct parameters. Across multiple benchmark runs, unskilled agents fell into two consistent failure modes: either generating synthetic data rather than using pharmaverse reference datasets, or writing bespoke dplyr pipelines that reimplemented logic {admiral} already provides — incorrectly.

One example from BDS benchmarking is particularly telling. Without skill guidance, agents consistently used derive_vars_merged() where derive_vars_merged_lookup() was required for parameter code assignment. Both functions exist in {admiral} . Both execute without error. But derive_vars_merged() drops unmatched records silently, producing a dataset with the wrong row count. No warning. No crash. Just wrong output that passes a casual review.

This is not a model quality problem. It is a knowledge problem. The model does not know what the pharmaverse community knows.

The Package as Specification

{admiral} is more than a collection of R functions. It is a community-maintained encoding of CDISC ADaM logic — accumulated through years of collaboration across sponsors, CROs, and regulators, tested against real submissions, and versioned for traceability. When a programmer calls derive_vars_dtm() with the correct imputation flags, they are not just writing R code. They are implementing a specification that has been reviewed, validated, and documented.

An LLM trained on general R code does not reliably inherit that specification. It has seen {admiral} in its training data, but not with the depth or precision needed to apply it correctly across the full range of ADaM derivation scenarios. The unskilled agent that wrote a custom parse_dtc_datetime() function using substr() and as.POSIXct() — rather than calling derive_vars_dtm() — was not being lazy. It was doing its best with what it knew. Its best was not good enough, and the errors it introduced were in the edge cases that matter most in a clinical submission.

What the Skill Does

The {admiral} skills in pharma-skills do not replace {admiral} . They connect the AI agent to it. A skill provides curated, domain-aware guidance: which functions to use for which derivations, how to structure the program for QC readability, which variables require special handling, and what assertions to include. The skill is the bridge between a capable general-purpose model and the specific, validated logic the pharmaverse community has built.

The benchmark results reflect this directly. Across ADSL, ADAE, ADVS, and ADLB:

With skill: 88–100% pass rates across domains
Without skill: 17–59% pass rates, with high variance

That variance in the unskilled condition matters as much as the mean. Inconsistent output is not a defensible process in a GxP context. A skill-guided agent produces consistent, traceable, {admiral} -anchored code. An unskilled agent produces something different every time.

The Accountability Anchor

There is a regulatory dimension here that goes beyond code quality. A clinical submission needs to trace its derivations to validated, versioned, documented methods. A bespoke LLM-generated pipeline — however functional — has no such anchor. {admiral} provides it. When a submission uses derive_var_trtemfl() from a pinned version of {admiral} , the derivation logic is documented, community-reviewed, and reproducible. The AI is most useful when it is writing code that inherits those properties, not when it is improvising around them.

This is why the pharma-skills project frames skills not as prompt templates, but as domain knowledge artifacts. The goal is not to make AI write more R code. It is to make AI write {admiral} code — correctly, consistently, and in a form that a human reviewer can audit and a regulatory submission can defend.

{admiral} was built for exactly this moment. The community just needs to make sure AI knows how to use it.

Last updated

2026-06-14 18:52:36.863336

Details

Source, Session info

Reuse

CC BY 4.0

Citation

BibTeX citation:

@online{dickinson2026,
  author = {Dickinson, Jeff},
  title = {Why We Still Need \{Admiral\} in an Age of {AI}},
  date = {2026-06-14},
  url = {https://pharmaverse.github.io/blog/posts/2026-06-14_admiral_in_age_of_ai/admiral_in_age_of_ai.html},
  langid = {en}
}

For attribution, please cite this work as:

Dickinson, Jeff. 2026. “Why We Still Need {Admiral} in an Age of AI.” June 14, 2026. https://pharmaverse.github.io/blog/posts/2026-06-14_admiral_in_age_of_ai/admiral_in_age_of_ai.html.

To leave a comment for the author, please follow the link and comment on their blog: pharmaverse blog.

R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.

Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

Continue reading: Why we still need {admiral} in an age of AI

Seasonal adjustment by @ellis2013nz

free range statistics - R — Sat, 13 Jun 2026 13:00:00 +0000

[This article was first published on free range statistics - R, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)

Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

A reasonably straightforward post today. I wanted to look at monthly tourism numbers in Samoa. In fact I started to do this for Pacific islands in general, but the data wrangling challenges were sufficient that I only got as far as Samoa for now. There’s interest in these at the moment because they would be a relatively timely indicator of possible economic damage from the fuel crisis related to the Iran war. Visitor numbers, inflation, and merchandise trade are part of the very select number of monthly published economic statistics in this part of the world (for a select subset of countries).

There’s two things happening in this post:

getting hold of the Samoa’s visitor arrivals numbers; and
seasonally adjusting them.

The latter is more interesting to me than the former, but unfortunately the former took most of the time.

Data wrangling

Here’s what the visitor numbers look like, once we’ve got them all into one data frame:

The Samoa Bureau of Statistics has a nice Excel workbook up to May 2023, but from that date onwards the data is only available as far as I can see in PDF reports. Luckily these are all available as links from a single page, but there seems to be either a skill issue on my part or some kind of block on the website that stops any systematic download of them all, so I had to download all the PDFs by hand one at a time.

Then Claude helped me write a parser to find the visitor arrivals number in each PDF. Actually, Claude wasn’t any good at actually finding the right number, but it did give me a pattern I could adapt, much as in the old days I would have used Stack Overflow. There are a lot of numbers in each PDF and we need the right one—visitor arrivals, not total arrivals (yes, that’s an em dash, but I write them all by hand). In this case it turns out that the trick is that the PDFs all include the sentence “Overall visitor numbers for the month under review stood at [number]”, and luckily there is no other use of the word “stood” in the document.

The other fiddly thing was extracting the actual date each PDF referred to. Then everything needed to be tested. In the end, it would have been quicker to just manually type the 35 numbers I needed. But here’s the code that does all the data wrangling.

library(tidyverse)
library(rvest)
library(httr2)
library(lubridate)
library(pdftools)   
library(readxl)
library(seasonal)
library(tsibble)
library(fable)
library(feasts)
library(ggtext)
library(scales)

#' Extract date from messy filenames
extract_date <- function(messy_date){
  tibble(path = messy_date) |>
  mutate(
    stem      = tools::file_path_sans_ext(basename(path)),
    # Remove any leading word followed by _ or - (e.g. "Migration_")
    stem      = str_remove(stem, "^([A-Za-z]+[_-])+(?=[A-Z])"),
    month_str = str_extract(stem, "[A-Za-z]{3,}"),
    # Normalise non-standard abbreviations
    month_str = str_replace(month_str, "^Sept$", "Sep"),
    year_str  = str_extract(stem, "\\d{4}|\\d{2}"),
    year      = if_else(nchar(year_str) == 2,
                        as.integer(paste0("20", year_str)),
                        as.integer(year_str)),
    date      = as.Date(paste(year, month_str, "01"), format = "%Y %B %d") |>
                  coalesce(as.Date(paste(year, month_str, "01"), format = "%Y %b %d"))
  ) |>
  pull(date)
}

test <- c("samoa_pdfs/April_25.pdf", "samoa_pdfs/Feb_25.pdf", "samoa_pdfs/Feb_26.pdf", 
"samoa_pdfs/Jan_26.pdf", "samoa_pdfs/January_25.pdf", "samoa_pdfs/July_25.pdf", 
"samoa_pdfs/June-25.pdf", "samoa_pdfs/March_2025.pdf", "samoa_pdfs/March_2026.pdf", 
"samoa_pdfs/May_25.pdf", "samoa_pdfs/Migration_April-2026.pdf",
"samoa_pdfs/Sept-24.pdf", "samoa_pdfs/Migration_Rep_June_2023.pdf")

extract_date(test)

#-------------------PDFs for recent data------------------
# For the more recent years no Excel tables are published, so need
# to use the PDFs and extract total from there
# These had to be downloaded by hand - nothing I tried was able to automate
# that. Download from https://www.sbs.gov.ws/migration/ and save
# in a subfolder /samoa_pdfs/.

pdf_dir <- "samoa_pdfs"
tbl <- tibble(local_path = list.files(pdf_dir, pattern = ".pdf$", full.names = TRUE))

parse_pdf_visitors <- function(path) {
  txt <- tryCatch(pdf_text(path), error = function(e) {
    message("  [WARN] pdftools could not read: ", basename(path))
    NULL
  })
  if (is.null(txt)) return(NA_integer_)
  full_text <- paste(txt, collapse = "\n")
  patterns <- c(
    "stood at [^0-9(]{0,10}\\(?([0-9,]+)\\)?"
  )
  for (pat in patterns) {
    m <- str_match(full_text, pat)[, 2]
    if (!is.na(m)) return(as.integer(str_remove_all(m, ",")))
  }
  message("  [WARN] Could not parse visitor count from: ", basename(path))
  NA_integer_
}

found <- tbl |> 
  filter(file.exists(local_path))

message("  Parsing ", nrow(found), " local PDFs...")
pdf_tbl <- found |>
  mutate(visitors = map_int(local_path, \(p) {
    message("  ", basename(p))
    parse_pdf_visitors(p)
  })) |>
  filter(!is.na(visitors)) |>
  mutate(date = extract_date(local_path))

#-----------------Excel versions for older data------------
# For May 2023 and earlier we can get the data for multiple months
# at a time from Table 1 of the Excel tables. The May 2023 Excel
# file goes back to 2017 January (although the rows are hidden)

fn <- "May_23.xlsx"
if(!file.exists(fn)){
  download.file("https://sbs.gov.ws/images/sbs-documents/social/Arrival/2023/May_23.xlsx",
                destfile = fn, mode = "wb")
}
x <- read_excel(fn, sheet = "Table 1", 
                 range = "D48:D130",
                 col_names = "visitors") |> 
  drop_na() |> 
  pull(visitors)

historical <- tibble(visitors = x, 
               date = seq(as.Date("2017-01-01"), as.Date("2023-05-01"), by = "month"))

#-----------combine and test-----------------------
samoa_visitors <- pdf_tbl |> 
  select(date, visitors) |>
  bind_rows(historical) |> 
  arrange(date) |> 
  mutate(date_month = yearmonth(date)) |> 
  as_tsibble(index = date_month)


# Test - some hand picked test cases, 4 from PDFs and 3 from the Excel
samoa_test <- tribble(~date, ~correct_visitors,
                      "2023-08-01", 16471,
                      "2024-04-01", 12644,
                      "2024-08-01", 17248,
                      "2026-04-01", 14188,
                      "2018-02-01", 7413,
                      "2020-12-01", 195,
                      "2022-06-01", 866) |> 
  mutate(date = as.Date(date))

stopifnot(
  samoa_test |> 
    anti_join(samoa_visitors, by = c("date", "correct_visitors" = "visitors")) |> 
    nrow() == 0
)

And here’s the code that draws the basic time series chart I used earlier:

the_caption = "Source: Samoa Bureau of Statistics"

ggplot(samoa_visitors, aes(x = date, y = visitors)) +
  geom_line() +
  scale_y_continuous(label = comma) +
  labs(x = "",
       y = "Visitor arrivals",
       title = "Visitor arrivals per month to Samoa",
       subtitle = "Unadjusted originals",
       caption = the_caption)

Modelling seasonal decomposition

Phew. Ok, on to the more fun part of actually modelling. I intend to use the X-13ARIMA-SEATS tool. X‑13ARIMA‑SEATS is a program developed and maintained by the US Census Bureau for seasonal adjustment and time‑series decomposition. It will automatically fit a SARIMA (seasonal autoregressive integrated moving average) time series model, has built in methods for identifying and dealing with outliers, by default adjusts for number of trading days and the moving Easter holiday, and allows the user to specify additional regression explanatory variables if you want. It’s the go-to across the world for seasonal adjustment of official statistics.

X-13ARIMA-SEATS is available in R through the seasonal package (by Christoph Sax and Dirk Eddelbuettel). Downstream of that, the feasts and fable packages by (Mitchell O’Hara-Wild, Rob Hyndman and Earo Wangmake) make it easier to work with in a tabular, tidyverse approach.

The Covid period is an obvious dominating factor in tourism anywhere in recent decades, and shows up in the first chart I showed above. I’m also interested in the period since the USA-Israel-Iran war began to see if there is an impact from that. There’s only two months of data (March and April 2026) since the war started, so an impact would have to be dramatic to show up, but it’s worth checking.

X-13ARIMA-SEATS by default will fit forecasts when you ask it to model so you need to provide values of x regressor variables to cover not only the period of the data but a few periods out ahead. I’m doing this as simple time series vectors—I haven’t yet worked out the easy way to do this in the tabular world of fable. Luckily, it seems I can use these vectors later even in fable. In a moment I’ll use both seasonal directly and via fable to make sure I get the same results. In fact, updating my dated knowledge of time series modelling to use fable was one of my main motivations for this whole exercise.

Here’s the code that makes the x regressors I’ll be using for the presence of the Covid pandemic and for the Iran war:

#----------------x regressor variables-----------------

# Covid time series indicator to use as a regressor
covid_reg <- ts(
  as.numeric(seq(as.Date("2017-01-01"), as.Date("2029-04-01"), by = "month") %in%
    seq(as.Date("2020-04-01"), as.Date("2022-07-01"), by = "month")),
  start     = c(2017, 1),
  frequency = 12
)

# Iran war time series indicator
war_reg <- ts(
  as.numeric(seq(as.Date("2017-01-01"), as.Date("2029-04-01"), by = "month") %in%
    seq(as.Date("2026-03-01"), as.Date("2026-07-01"), by = "month")),
  start     = c(2017, 1),
  frequency = 12
)

Directly with `seasonal`

Ok, it’s modelling time. First, using just an old fashioned time series vector of Samoa’s visitor arrivals and the seasonal package directly, here’s fitting the X-13ARIMA-SEATS model with defaults using both the Covid and Iran war regressors:

sa_ts <- ts(samoa_visitors$visitors, frequency = 12, start = c(2017, 1))

fit_ts_war <- seas(sa_ts, xreg = cbind(covid_reg, war_reg))
summary(fit_ts_war)

Super simple. That gets us this output:

Call:
seas(x = sa_ts, xreg = cbind(covid_reg, war_reg))

Coefficients:
                  Estimate Std. Error z value Pr(>|z|)    
xreg1             -3.10069    0.14567 -21.286  < 2e-16 ***
xreg2             -0.15937    0.13098  -1.217    0.224    
LS2020.Mar        -0.85036    0.14567  -5.838 5.29e-09 ***
AO2020.Dec        -0.75529    0.12350  -6.115 9.63e-10 ***
LS2021.May        -0.79012    0.13233  -5.971 2.36e-09 ***
AO2021.Jul        -1.36973    0.12378 -11.066  < 2e-16 ***
LS2022.May         0.89068    0.13233   6.731 1.68e-11 ***
LS2022.Aug        -1.07689    0.19608  -5.492 3.97e-08 ***
AR-Nonseasonal-01 -0.59006    0.07041  -8.380  < 2e-16 ***
MA-Seasonal-12     0.99961    0.08044  12.427  < 2e-16 ***
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

SEATS adj.  ARIMA: (1 1 0)(0 1 1)  Obs.: 112  Transform: log
AICc:  1608, BIC:  1634  QS (no seasonality in final):3.295  
Box-Ljung (no autocorr.): 26.42   Shapiro (normality): 0.9721 *

Some key things to note here.

The data was log-transformed, which is good—I would certainly have chosen to do this, or at least a square root transform, given the way visitor arrivals variance increases as its mean does, all around the world.

The model eventually adopted is described as ARIMA (1 1 0)(0 1 1). This means the main series as a autoregression term on one lag, after one round of differencing; and the seasonal part has a moving average term on on lag, after one round of differencing. This is a very normal model for tourism numbers. it indicates a general trend/drive (the differencing in the main series), a tendency for the values in one month to be related one way or another to those in the month before (in this case with a negative correlation of -0.59) and a strong annual seasonality effect that changes slowly over time.

The Covid dummy variable (xreg1) is strongly negatively significant. With a coefficient of -3.1 and the log transform of the response variable, this means that during the Covid period the actual arrivals were on average exp(-3.1) = 0.045 (ie 4.5% or down 95.5%) of during the non-Covid periods.

In contrast, we don’t have a statistically significant effect for the Iran war (xreg2). For subsequent analysis I will take out that x regressor as I don’t want it complicating the recent trend and seasonally adjusted figures.

Six months’ data have been singled out as outliers and controlled for appropriately. These are all in the difficult-to-model Covid period of 2020 to 2022.

One point to note is that we don’t have an Easter effect. I am 100% sure there is really an Easter effect in Samoa’s visitor arrivals, but 9 years of data isn’t enough to show it. Easter is sometimes in April and sometimes in March. But since 2017, it happens to have been in April every year apart from 2024. That’s just not enough variation to distinguish it from regular monthly seasonal impacts.

To check my recollection that Easter is indeed checked for by default in X-13ARIMA-SEATS, I fit a model to the well known Box and Jenkins airline data:

> # Another examplefor comparison
+ m <- seas(AirPassengers)
+ summary(m)

Call:
seas(x = AirPassengers)

Coefficients:
                    Estimate Std. Error z value Pr(>|z|)    
Weekday           -0.0029497  0.0005232  -5.638 1.72e-08 ***
Easter[1]          0.0177674  0.0071580   2.482   0.0131 *  
AO1951.May         0.1001558  0.0204387   4.900 9.57e-07 ***
MA-Nonseasonal-01  0.1156204  0.0858588   1.347   0.1781    
MA-Seasonal-12     0.4973600  0.0774677   6.420 1.36e-10 ***
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

SEATS adj.  ARIMA: (0 1 1)(0 1 1)  Obs.: 144  Transform: log
AICc: 947.3, BIC: 963.9  QS (no seasonality in final):    0  
Box-Ljung (no autocorr.): 26.65   Shapiro (normality): 0.9908

Here we see that the number of week days in a month and the moving Easter holiday are indeed in the final model, without me having to have asked for them to be checked. Easter has a positive impact on this air passengers series (1949 to 1960), and the number of week days in a month has a smaller negative impact.

So after that Easter-checking digression, I refit the model for my Samoa visitor arrivals without the war regressor and get an essentially identical result:

> fit_ts <- seas(sa_ts, xreg = covid_reg)
> summary(fit_ts)

Call:
seas(x = sa_ts, xreg = covid_reg)

Coefficients:
                  Estimate Std. Error z value Pr(>|z|)    
xreg              -3.10045    0.14690 -21.106  < 2e-16 ***
LS2020.Mar        -0.83292    0.14617  -5.698 1.21e-08 ***
AO2020.Dec        -0.75463    0.12449  -6.062 1.35e-09 ***
LS2021.May        -0.78975    0.13356  -5.913 3.36e-09 ***
AO2021.Jul        -1.36737    0.12476 -10.960  < 2e-16 ***
LS2022.May         0.89113    0.13356   6.672 2.52e-11 ***
LS2022.Aug        -1.07733    0.19782  -5.446 5.15e-08 ***
AR-Nonseasonal-01 -0.58577    0.07071  -8.284  < 2e-16 ***
MA-Seasonal-12     0.99912    0.07827  12.765  < 2e-16 ***
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

SEATS adj.  ARIMA: (1 1 0)(0 1 1)  Obs.: 112  Transform: log
AICc:  1607, BIC:  1631  QS (no seasonality in final):2.501  
Box-Ljung (no autocorr.): 26.63   Shapiro (normality): 0.9706 *

With `fable` and `feasts`

tsibble, fable and feasts are a brilliant set of packages that let you work with time series in R in a more tabular and tidyverse-friendly way than the various older time series data structures let you. Most of my work with time series was before they were available, though, so I’m lacking confidence in how they work. Luckily it seems to be pretty straightforward.

I’d already done the critical step earlier with as_tsibble(index = date_month), specifying my main samoa_visitors tibble is actually a time series tibble. Now the modelling using that tsibble is pretty simple:

#----------fable version-------
fit_fb <- samoa_visitors |> 
  model(X_13ARIMA_SEATS(
    visitors ~ xreg(covid_reg)
  ))

report(fit_fb)

Note we use report() rather than summary() to get the end result. I’m not going to print it here because it is literally identical to what we got earier with summary(fit_ts).

The main appeal for me in using the fable/feasts approach is that it fits better with both my data wrangling workflow and my approach to ggplot2 graphics. so here is a nice decomposition of hte original timeseries, produced with autoplot(

fit_fb |> 
  components() |> 
  autoplot()

Note that in this decomposition, the original ‘visitors’ series and the trend are on the original scale, but the ‘seasonal’ and ‘irregular’ components are expressed as multipliers. So a seasonal value of 1.2 means in a given month the value is 20% higher as a result of the seasonality than otherwise.

And here is my final, presentation version of the data:

Produced with this code:

comp_data <- fit_fb |> 
  components() |> 
  # the 'trend' that comes straight from the decomposition does
  # not adjust ofr the Covid coefficient and looks pretty weird
  # so it is more intuitive to present it after adjustment for
  # Covid, which we need to calculate by multiplying (because of
  # the log transform that SEATS used autoamtically):
  mutate(covid = as.numeric(covid_reg)[1:nrow(samoa_visitors)],
         trend_adj = trend * exp(covid * coef(fit_ts)[1])) 

comp_data|> 
  ggplot(aes(x = date_month, y = season_adjust)) +
  geom_line(linewidth = 1.3) +
  geom_line(aes(y = trend_adj), colour = "steelblue", alpha = 0.9, linewidth = 1.2) + 
  geom_point(aes(y = visitors), colour = "grey70") +
  scale_y_continuous(label = comma) +
  labs(x = "",
       y ="Visitor arrivals",
       title = "Visitor arrivals per month to Samoa",
       subtitle = "Original, seasonally adjusted and trend (adjusted for Covid period).",
       caption = "Source: data from Samoa Bureau of Statistics. Seasonal adjustment by freerangestats.info.") +
  theme(plot.subtitle = element_markdown())

After all that, what insight do we have? Well, not a lot really, other than the blindingly obvious trends of the devastation to the industry of Covid and the slow and noisy growth trend since then. We are at least well placed to make commentary on the impact of the fuel crisis on tourism, and can say that there isn’t any evident yet. If and when we do see the impact, we’ll be able to talk about in terms of trends and random variation, after having removed the seasonal element. So that’s useful.

Well, that’s all. Perhaps I’ll get around in a later post to adding the other Pacific island countries with monthly tourism data—Fiji, Vanuatu, Cook Islands, French Polynesia being the key ones I’m aware of.

To leave a comment for the author, please follow the link and comment on their blog: free range statistics - R.

R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.

Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

Continue reading: Seasonal adjustment by @ellis2013nz

New CRAN Packages: signal or noise?

Joseph Rickert — Fri, 12 Jun 2026 00:00:00 +0000

[social4i size="small" align="align-left"] -->

[This article was first published on R Works, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)

Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

If you are reading this post on R-bloggers, you will probably know that I have been publishing my selection of the “Top 40” new R packages on CRAN for quite some time. I did this first as part of my work at Revolution Analytics, then on R Views for RStudio and Posit, and now here on R Works. It used to take about a day’s worth of pleasurable work spread out over a month to select forty interesting packages. For a hundred or so packages, I could look at all of the package webpages, download and play with a small number of them. Now, the “Top 40” has become a real hamster-on-the-wheel project. The following plot shows my count of the number of new packages to make it to CRAN since I began publishing on R Works.

Show plot code

Test Doubles Taxonomy for R: Dummy, Stub, Spy, Mock, Fake

Jakub Sobolewski — Fri, 12 Jun 2026 00:00:00 +0000

[This article was first published on Jakub Sobolewski, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)

Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

You might call them all “mock”.

Mock the database. Mock the API. Mock the function. The word becomes a catch-all for any test double, any object you substitute for a real dependency in a test. Lumping them together makes it harder to choose the right tool, and the wrong choice leads to brittle, misleading tests.

There are five distinct types, each with a specific job. Knowing which is which is how you stop writing tests that do the wrong thing.

The code under test

All five examples use a single function: process_payment. It charges a card, logs the attempt, and optionally notifies the customer.

process_payment <- function(order, payment_gateway, logger, notifier = NULL) {
  logger$log(paste("Processing order", order$id))

  result <- payment_gateway$charge(order$amount, order$card_token)

  if (!result$success) stop("Payment failed: ", result$error)

  if (!is.null(notifier)) {
    notifier$send(order$customer_id, result$transaction_id)
  }

  result$transaction_id
}

It has three dependencies: payment_gateway, logger, and notifier. Each one will be replaced with a different kind of double depending on what we’re trying to test.

1. Dummy

Definition: an object passed to satisfy a required parameter but never actually used by the test.

process_payment always calls logger$log. The logger is required. But for a test that’s only checking whether the correct transaction ID is returned, we don’t care what gets logged. We just need something that won’t blow up when called.

test_that("returns the transaction ID on successful payment", {
  # Arrange
  order <- list(
    id = "ord-1",
    amount = 100,
    card_token = "tok_visa",
    customer_id = "cust-42"
  )
  dummy_logger <- list(log = function(...) invisible(NULL))
  stub_gateway <- list(
    charge = function(amount, token) {
      list(success = TRUE, transaction_id = "txn-abc")
    }
  )

  # Act
  result <- process_payment(
    order,
    payment_gateway = stub_gateway,
    logger = dummy_logger
  )

  # Assert
  expect_equal(result, "txn-abc")
})
Test passed with 1 success 🥇.

dummy_logger accepts any call and does nothing. The test doesn’t assert on it at all. Its only job is to satisfy the function signature.

A dummy should be the simplest thing that compiles. Recording calls or setting expectations would make it something else. If you find yourself writing a dummy that crashes, or does something unexpected when called, the code path you’re testing actually does use the dependency.

Worth knowing.

2. Stub

Definition: a replacement that returns pre-programmed responses, used to control what the code under test receives.

A stub lets you put the system in a specific state without involving real infrastructure. If you want to test what process_payment does when a card is declined, you don’t need a real payment API. You just return the response you want.

test_that("throws an error when payment is declined", {
  # Arrange
  order <- list(
    id = "ord-2",
    amount = 200,
    card_token = "tok_declined",
    customer_id = "cust-7"
  )
  dummy_logger <- list(log = function(...) invisible(NULL))
  stub_gateway <- list(
    charge = function(amount, token) {
      list(success = FALSE, error = "insufficient funds")
    }
  )

  # Act & Assert
  expect_error(
    process_payment(
      order,
      payment_gateway = stub_gateway,
      logger = dummy_logger
    ),
    "insufficient funds"
  )
})
Test passed with 1 success 🥇.

The stub provides inputs to the system under test. You assert on what the code did with those inputs (in this case, that it threw the right error).

Notice that process_payment accepts payment_gateway as an argument. That’s dependency injection: the function doesn’t create or import its own gateway, so the test can pass in anything with the same interface. Without it you’d need a patching library to intercept the real dependency mid-call. With it, a plain list with a charge function is enough. Stubs work best when the code is designed this way: dependencies accepted as arguments, not hardwired inside.

If you practice test-first development, you’ll notice that you use this pattern all the time. You can’t write the test without it. You don’t know what to patch in a function that doesn’t exist yet! It’s only natural to inject all dependencies as you write the interface of your code.

When the dependency isn’t declared in the interface, when the function calls another function directly by name, mockery::stub() can patch it for the duration of a test:

# A function that calls charge_card() internally, with no way to inject it
process_payment_legacy <- function(order) {
  result <- charge_card(order$amount, order$card_token)
  if (!result$success) {
    stop("Payment failed: ", result$error)
  }
  result$transaction_id
}

charge_card <- function(amount, token) {
  stop("would call real payment API")
}

test_that("returns transaction ID when charge succeeds", {
  # Arrange
  order <- list(amount = 100, card_token = "tok_visa")
  mockery::stub(
    process_payment_legacy,
    "charge_card",
    function(amount, token) {
      list(success = TRUE, transaction_id = "txn-stub")
    }
  )

  # Act
  result <- process_payment_legacy(order)

  # Assert
  expect_equal(result, "txn-stub")
})
Test passed with 1 success 🥳.

mockery::stub() replaces charge_card inside the scope of process_payment_legacy for that one test call, without touching the real function anywhere else.

mockery::stub() has a catch. The stub is targeted by function name as a string, so if you rename charge_card, the stub silently stops working and the test passes against the real function with no warning. The test is also coupled to an implementation detail: if you refactor process_payment_legacy to call payment_gateway$charge() instead, the stub breaks even if the behavior is unchanged. That’s the Overspecification smell.

Use mockery::stub() when you’re working with legacy code that wasn’t built with testability in mind and you can’t refactor the interface right now. It lets you get tests in place quickly. Treat it as a stepping stone: once the characterization tests are green, refactor toward dependency injection and replace the patch with a plain stub passed as an argument.

To sum up: when you need to control what a dependency returns and don’t care how it was called, reach for a stub.

3. Spy

Definition: a stub that also records calls made to it, so you can assert on them afterward.

Sometimes the behavior you’re testing is a side effect. A notification that should have been sent, a message that should have been logged. The code doesn’t return a value you can assert on. It calls something. A spy captures those calls.

make_notifier_spy <- function() {
  calls <- list()
  list(
    send = function(customer_id, transaction_id) {
      calls[[length(calls) + 1]] <<- list(
        customer_id    = customer_id,
        transaction_id = transaction_id
      )
    },
    calls = function() calls
  )
}
test_that("notifies the customer after successful payment", {
  # Arrange
  order <- list(
    id = "ord-3",
    amount = 50,
    card_token = "tok_visa",
    customer_id = "cust-99"
  )
  dummy_logger <- list(log = function(...) invisible(NULL))
  stub_gateway <- list(
    charge = function(amount, token) {
      list(success = TRUE, transaction_id = "txn-xyz")
    }
  )
  spy_notifier <- make_notifier_spy()

  # Act
  process_payment(
    order,
    payment_gateway = stub_gateway,
    logger = dummy_logger,
    notifier = spy_notifier
  )

  # Assert
  expect_length(spy_notifier$calls(), 1)
  expect_equal(spy_notifier$calls()[[1]]$customer_id, "cust-99")
  expect_equal(spy_notifier$calls()[[1]]$transaction_id, "txn-xyz")
})
Test passed with 3 successes 🌈.

The spy is a stub with memory. You call the code, then interrogate the spy to see what happened.

You don’t always need to build a spy by hand. mockery::mock() also collects calls, so it can serve as a spy when you want the recording behaviour without writing the closure yourself:

test_that("notifies the customer after successful payment (mockery spy)", {
  # Arrange
  order <- list(
    id = "ord-3b",
    amount = 50,
    card_token = "tok_visa",
    customer_id = "cust-99"
  )
  dummy_logger <- list(log = function(...) invisible(NULL))
  stub_gateway <- list(
    charge = function(amount, token) {
      list(success = TRUE, transaction_id = "txn-xyz")
    }
  )
  spy_send <- mockery::mock()

  # Act
  process_payment(
    order,
    payment_gateway = stub_gateway,
    logger = dummy_logger,
    notifier = list(send = spy_send)
  )

  # Assert
  mockery::expect_called(spy_send, 1)
  expect_equal(mockery::mock_args(spy_send)[[1]][[1]], "cust-99")
  expect_equal(mockery::mock_args(spy_send)[[1]][[2]], "txn-xyz")
})
Test passed with 3 successes 😀.

The handwritten version is clearer when you want the recording mechanism visible to readers, useful in a codebase where not everyone knows mockery. mockery::mock() is more concise once the team is familiar with the library.

The difference from a mock comes down to return values. A spy records calls and nothing else. A mock records calls and can also return pre-programmed values, which makes it useful when you need the dependency to behave a specific way and want to assert on how it was used.

4. Mock

Definition: “a double pre-programmed with expectations that form a specification of the calls it should receive. A true mock can throw if it receives a call it doesn’t expect, and is checked during verification to confirm it got all the calls it was expecting.”^{[1, Fowler]}

mockery::mock() is looser than that definition. It accepts any call without complaining and doesn’t enforce expectations upfront. It records every call it receives (the arguments, the order, the count) and returns pre-programmed values you supply. Verification is your responsibility in the Assert step.

test_that("sends exactly one notification with correct arguments", {
  # Arrange
  order <- list(
    id = "ord-4",
    amount = 75,
    card_token = "tok_visa",
    customer_id = "cust-11"
  )
  dummy_logger <- list(log = function(...) invisible(NULL))
  stub_gateway <- list(
    charge = function(amount, token) {
      list(success = TRUE, transaction_id = "txn-def")
    }
  )
  mock_notifier <- list(send = mockery::mock())

  # Act
  process_payment(
    order,
    payment_gateway = stub_gateway,
    logger = dummy_logger,
    notifier = mock_notifier
  )

  # Assert
  mockery::expect_called(mock_notifier$send, 1)
  mockery::expect_args(mock_notifier$send, 1, "cust-11", "txn-def")
})
Test passed with 5 successes 🥳.

Use a mock when the interaction itself is what you’re testing: whether the code called the dependency in the right way, with the right arguments.

They’re also the easiest double to overuse. Assert on every call to every dependency and you’ve written an overspecified test, one that breaks whenever the implementation changes even when the behavior stays the same.

Prefer a spy when you only need to record calls. A plain list with a function that appends to a vector is often enough. Reach for a mock when you also need to control what the dependency returns. The risk is the same with any interaction-based assertion: check every call to every dependency and you end up with a test that mirrors the implementation rather than the behaviour, breaking whenever the internals change even when the outcome doesn’t.

5. Fake

Definition: a working implementation that’s simpler than the real thing, suitable for tests but not production.

A fake isn’t just a pre-programmed response. It has real behavior. An in-memory database is a fake: it stores and retrieves data like the real thing, just without persistence or network overhead. It behaves correctly across multiple calls, which a stub can’t do.

make_fake_payment_gateway <- function() {
  transactions <- list()

  list(
    charge = function(amount, token) {
      if (amount <= 0) {
        return(list(success = FALSE, error = "invalid amount"))
      }
      if (token == "tok_declined") {
        return(list(success = FALSE, error = "card declined"))
      }

      id <- paste0("txn-", length(transactions) + 1)
      transactions[[id]] <<- list(
        amount = amount,
        token = token
      )
      list(success = TRUE, transaction_id = id)
    },
    find = function(transaction_id) {
      transactions[[transaction_id]]
    }
  )
}
test_that("successful charges are recorded in the gateway", {
  # Arrange
  order <- list(
    id = "ord-5",
    amount = 120,
    card_token = "tok_visa",
    customer_id = "cust-3"
  )
  dummy_logger <- list(log = function(...) invisible(NULL))
  fake_gateway <- make_fake_payment_gateway()

  # Act
  txn_id <- process_payment(
    order,
    payment_gateway = fake_gateway,
    logger = dummy_logger
  )

  # Assert
  recorded <- fake_gateway$find(txn_id)
  expect_equal(recorded$amount, 120)
  expect_equal(recorded$token, "tok_visa")
})
Test passed with 2 successes 🎊.

Fakes work well when you need to test behaviour across multiple operations: place an order, query its status, refund it. A stub would need to be reprogrammed for each call. A fake just handles it.

They’re also a good fit for acceptance tests and manual inspection. An acceptance test exercises a full user-facing behaviour end-to-end, several layers of the application working together. At that level you don’t want stubs reprogrammed for individual calls; you want a dependency that behaves realistically across the whole flow. A fake payment gateway, a fake email sender, a fake file store: these let your acceptance test suite run in CI without connecting to external services, needing credentials, or leaving side effects behind. You can also wire the same fakes into a development mode of the app. Spin up the Shiny app pointing at the in-memory gateway and you can click through every payment scenario without touching a real API.

The cost is that fakes take time to build and maintain. They need to be kept in sync with the real interface they’re replacing. For a small, stable interface that’s used heavily across your test suite and in manual workflows, the investment pays off. For a dependency you only use in one unit test, a stub is simpler.

When to reach for each one

Double	Has behaviour	When to use
Dummy		Fill a required parameter you won’t touch
Stub	Pre-programmed only	Control what the code receives
Spy	Pre-programmed only	Assert on side effects after the fact
Mock (`mockery`)	Pre-programmed only	Assert on calls and control what the code receives
Mock	Pre-programmed only	Pin an exact interaction as a hard contract
Fake		Replace stateful or multi-call dependencies

The key difference is between stub and mock. A stub returns values. You assert on the outcome. A mock records calls and can return pre-programmed values. Using a mock where a stub would do couples your test to implementation details. Using a stub where a mock is needed means missing the interaction you were trying to verify.

When in doubt: if you’re asserting on a return value or a state change, use a stub. If you’re asserting that a specific call was made, use a spy or a mock. If the dependency has real state that needs to survive across calls, build a fake.

Appendix: implementing an eager mock by hand

mockery::mock() is sufficient for everyday use. Skip this if you’re not curious about mock that throws failures during execution of code under test.

This is what a mock matching Fowler’s definition looks like in R. It takes a list of expected calls in the Arrange step, fails immediately on anything unexpected, and exposes a verify() function to confirm every expected call was made.

make_mock_notifier <- function(expected_calls) {
  received <- list()

  list(
    send = function(customer_id, transaction_id) {
      call <- list(
        customer_id = customer_id,
        transaction_id = transaction_id
      )
      match <- any(sapply(expected_calls, identical, call))
      if (!match) {
        testthat::fail(sprintf(
          "Unexpected call: send('%s', '%s')",
          customer_id,
          transaction_id
        ))
      }
      received[[length(received) + 1]] <<- call
    },
    verify = function() {
      for (exp in expected_calls) {
        found <- any(sapply(received, identical, exp))
        if (!found) {
          testthat::fail(sprintf(
            "Expected call never made: send('%s', '%s')",
            exp$customer_id, exp$transaction_id
          ))
        }
      }
      testthat::succeed()
    }
  )
}

The mock rejects unexpected calls on the spot:

test_that("throws immediately when called with unexpected arguments", {
  # Arrange
  order <- list(
    id = "ord-4b",
    amount = 75,
    card_token = "tok_visa",
    customer_id = "cust-11"
  )
  dummy_logger <- list(log = function(...) invisible(NULL))
  stub_gateway <- list(
    charge = function(amount, token) {
      list(success = TRUE, transaction_id = "txn-def")
    }
  )
  mock_notifier <- make_mock_notifier(
    expected_calls = list(list(
      customer_id = "cust-WRONG",
      transaction_id = "txn-def"
    ))
  )

  # Act — throws before we even reach Assert
  process_payment(
    order,
    payment_gateway = stub_gateway,
    logger = dummy_logger,
    notifier = mock_notifier
  )
})
── Failure: throws immediately when called with unexpected arguments ───────────
Unexpected call: send('cust-11', 'txn-def')
Backtrace:
    ▆
 1. └─global process_payment(...)
 2.   └─notifier$send(order$customer_id, result$transaction_id)

Error:
! Test failed with 1 failure and 0 successes.

And verify() catches expected calls that were never made:

test_that("fails verification when an expected call was never made", {
  # Arrange
  order <- list(id = "ord-4c", amount = 75, card_token = "tok_visa", customer_id = "cust-11")
  dummy_logger <- list(log = function(...) invisible(NULL))
  stub_gateway <- list(charge = function(amount, token) list(success = TRUE, transaction_id = "txn-def"))
  mock_notifier <- make_mock_notifier(
    expected_calls = list(
      list(customer_id = "cust-11", transaction_id = "txn-def"),
      list(customer_id = "cust-99", transaction_id = "txn-xyz")  # will never be called
    )
  )

  # Act
  process_payment(order, payment_gateway = stub_gateway, logger = dummy_logger, notifier = mock_notifier)

  # Assert
  mock_notifier$verify()
})
── Failure: fails verification when an expected call was never made ────────────
Expected call never made: send('cust-99', 'txn-xyz')

Error:
! Test failed with 1 failure and 1 success.

The happy path passes both checks:

test_that("passes when all expected calls are made and no unexpected ones occur", {
  # Arrange
  order <- list(
    id = "ord-4d",
    amount = 75,
    card_token = "tok_visa",
    customer_id = "cust-11"
  )
  dummy_logger <- list(log = function(...) invisible(NULL))
  stub_gateway <- list(
    charge = function(amount, token) {
      list(success = TRUE, transaction_id = "txn-def")
    }
  )
  mock_notifier <- make_mock_notifier(
    expected_calls = list(list(
      customer_id = "cust-11",
      transaction_id = "txn-def"
    ))
  )

  # Act
  process_payment(
    order,
    payment_gateway = stub_gateway,
    logger = dummy_logger,
    notifier = mock_notifier
  )

  # Assert
  mock_notifier$verify()
})
Test passed with 1 success 🥇.

References

Martin Fowler — TestDouble
Gerard Meszaros — Test Double Patterns

To leave a comment for the author, please follow the link and comment on their blog: Jakub Sobolewski.

R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.

Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

Continue reading: Test Doubles Taxonomy for R: Dummy, Stub, Spy, Mock, Fake

armadillo4r 1.0.0 is on CRAN

https://pacha.dev/blog — Thu, 11 Jun 2026 23:00:00 +0000

[This article was first published on https://pacha.dev/blog, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)

Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

pacha.dev/blog

armadillo 1.0.0 is on CRAN

C++

Armadillo

Linear algebra

armadillo 1.0.0 brings enhanced sparse matrix support, reduced dependencies, and comprehensive cross-platform testing.

Author

Mauricio “Pachá” Vargas S.

Published

June 12, 2026

I’m pleased to announce that armadillo 1.0.0 is now available on CRAN. This release brings substantial improvements to performance, reduced dependencies, and widely tested cross-platform compatibility.

Key Improvements

The 1.0.0 release brings several major enhancements:

Enhanced sparse matrix support: The package now offers seamless interoperability with R’s Matrix package, providing a more robust “translation” between R and Armadillo sparse matrices.
Reduced dependencies: All unit tests have been migrated from testthat to the lightweight tinytest suite, simplifying the dependency footprint.
Upgraded cpp4r dependency: The underlying cpp4r library has been refined to reduce dependencies while conditionally leveraging newer C++ features where available ( C++23 on modern platforms).
Comprehensive testing: The package has been validated across multiple platforms using R-Hub images for different C++ compilers and operating systems, complemented by GitHub Actions testing on macOS and Windows.

For more information, visit the CRAN package page or explore the 500+ examples.

If you liked this post, please consider donating to support my Open Source work: https://buymeacoffee.com/pacha.

To leave a comment for the author, please follow the link and comment on their blog: https://pacha.dev/blog.

R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.

Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

Continue reading: armadillo4r 1.0.0 is on CRAN

Ronny Hernandez Mora, Joel Nitta, and Nick Tierney Join rOpenSci Software Peer Review Editorial Team

rOpenSci — Thu, 11 Jun 2026 00:00:00 +0000

[This article was first published on rOpenSci - open tools for open science, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)

Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

What makes rOpenSci’s software peer review work is people who care deeply about the quality and usability of scientific software, and who give their time and expertise to help others build it better. Today, we’re pleased to announce three new members of our editorial team.

We welcome Joel H. Nitta, and Nicholas Tierney as new editors, and formally introduce Ronny Hernández Mora, who joined the editorial team in August 2025. Each brings a distinct perspective shaped by their work, their communities, and their experience with open-source R development. Together, they strengthen our capacity to serve the growing number of package authors who submit their work for review, and to uphold the collaborative, friendly and transparent standards that rOpenSci software peer review is known for.

Ronny Hernández Mora

Ronny is a PhD student at the University of Alberta and a research software developer with a background in data analysis and remote sensing. His current research explores how perception-driven drone systems can support detection, monitoring, and decision making in real world settings.

Before starting his PhD, Ronny worked as a data developer at ixpantia, developing data tools, automation pipelines, APIs, and production analytical applications for organizations across Latin America and the United States. Alongside his PhD, he works with Openscapes and contributes to opensource communities through teaching, mentorship, talks, and collaborative software projects.

Ronny on GitHub, Website.

I first heard about rOpenSci through a Data Latam podcast episode, and from there I started following the organization and its work. Over time, I got to know people connected to the community, but it was not until 2024 that I became directly involved by volunteering as a package reviewer.

I have always liked the idea of building software collaboratively: creating tools that can be understood, reviewed, reused, and improved by others. That is one of the things I value most about rOpenSci: the way it combines technical review with openness, care, and constructive feedback. I am grateful for the opportunity to contribute as an editor and to support authors and reviewers through that process and continue learning through this community effort.

Ronny

Joel H. Nitta

Joel is currently an associate professor at Chiba University, Japan, where he studies the evolution and ecology of ferns. Throughout his career as a botanist, he has also cultivated a keen interest in reproducible data analysis and has authored several R packages. Two of these are currently part of rOpenSci, canaper for spatial phylogenetic analysis, and dwctaxon for validation and maintenance of taxonomic databases. He is also an official maintainer of two rOpenSci packages, rgnparser for parsing taxonomic names, and restez for querying the GenBank DNA database. Outside of rOpenSci, Joel is active in the Bio”Pack”athon community in Japan, serves on the organizing committee of the Pteridophyte Phylogeny Group, and is a certified instructor for The Carpentries. Joel’s hobbies include various forms of being active outside (hiking, running, backpacking, botanizing, cycling), playing the euphonium, and tabletop games.

Joel on GitHub, Website.

rOpenSci has been incredibly important in my journey with data science and R. Above all, the extremely knowledgeable and helpful rOpenSci community has provided invaluable support as I transitioned from “R package user” to “R package developer”. While it may be possible to do this on your own, it is much more enjoyable and efficient when you are in the company of like-minded people. I especially appreciate this because I did not study computer science in undergrad and I am largely self-taught when it comes to R, having first learned it out of necessity during my graduate studies. I am very excited to officially join the editorial team and have the opportunity to give back to the organization that has supported me so much.

Joel

Nicholas Tierney

Nicholas (Nick) Tierney is a statistician, Research Software Engineer, and freelance consultant with a PhD in Statistics who specialises in data analytics, R package development, and teaching. Previously, he worked with Professor Nick Golding at The Kids Research Institute Australia and was a Research Fellow at Monash University with Professor Dianne Cook, where he developed tools for exploratory data analysis including visdat, naniar, and brolgar. Nick actively writes about R related projects at his blog, “credibly curious”. When not coding, Nick enjoys outdoor adventures and hiked the entire Pacific Crest Trail in 2023, documenting his journey at njt.micro.blog.

Nick on GitHub, Website.

I remember seeing rOpenSci online in 2014, at their first rOpenSci hackathon in San Francisco, USA. I was inspired by not just the projects they worked on, but the collective of people who were kind and generous as much as they were brilliant. I helped run an offshoot of the rOpenSci Unconf in Australia, which ran in 2016, 2017, 2018, and 2019. I received amazing mentorship and support from Karthik Ram and Stefanie Butland in learning how to run these unconferences. I think these have had a lasting impact on connecting people from Australia and New Zealand.

I had my R package, visdat, reviewed by rOpenSci in 2017, and this experience was formative to my understanding of software review, and greatly improved my own day to day practice. In 2024, with Eric Scott, and Andrew Brown, we submitted the geotargets package to rOpenSci.

rOpenSci has always been at the forefront of building a community of practice, and their model of peer review is a gold standard. Science would be better if more journals practised such transparency and kindness. It is an enormous honour to be able to give back to a community that has been so formative in my career and my life.

Nick

About the Software Peer Review Program

rOpenSci’s software peer review program brings together volunteers to collaboratively review scientific and statistical software according to transparent, constructive, and open standards. We have two different type of review: one for general research software packages and another for package that implement statistical methods. Editors like Ronny, Joel, and Nick are central to that process: they handle initial submission checks, identify and coordinate reviewers, and guide authors through the review until their package is ready.

Get Involved

Thinking about submitting a package? Start here:

rOpenSci Software Peer Review: scope, process, and guidelines;
rOpenSci Packages: Development, Maintenance, and Peer Review: the full guide for package authors;
rOpenSci Statistical Software Peer Review: the full guide for statistical software submissions;
Public software review threads on GitHub: see software peer review in action.

Would you like to contribute as a reviewer? We’d love to have you. Fill out the rOpenSci Reviewer Sign-Up Form and we’ll match you with packages that fit your expertise.

A warm welcome to Ronny, Joel, and Nick! We’re very glad to have you with us.

To leave a comment for the author, please follow the link and comment on their blog: rOpenSci - open tools for open science.

R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.

Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

Continue reading: Ronny Hernandez Mora, Joel Nitta, and Nick Tierney Join rOpenSci Software Peer Review Editorial Team

Eleven Latin American Voices for Open Science: The New Cohort of Champions rOpenSci 2026

rOpenSci — Tue, 09 Jun 2026 00:00:00 +0000

[This article was first published on rOpenSci - open tools for open science, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)

Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

Read it in: .

We are very happy to introduce the new rOpenSci Champions. This group will experience the program and work in Spanish, allowing us to continue to strengthen the open science and research software development community in this language. We are excited about the projects they will develop, which address real challenges from different disciplines and territories in Latin America.

We invite you to meet each of these people and the projects they will be working on throughout the program.

Bastián Olea Herrera

Bastián Olea Herrera
Undersecretary of Regional and Administrative Development
Government of Chile

My name is Bastián, I live in Chile, I am a sociologist by training and I have a Master’s degree in sociology. I like to create content and tutorials about R, and recently we are organizing a community of R users in Santiago, where I live. I am dedicated to data analysis in the public sector, where I work mainly with social data at different territorial levels. This kind of data always needs the same types of cleaning and corrections, but at the same time they usually vary a lot in small details between each source. That is why I applied to the program to develop a package that facilitates working with data at the communal and regional level in Chile. I hope to meet other R users and learn together, as well as receive support from people with more experience and knowledge, so that we can create useful tools for many people!

Denisse Fierro Arcos

Denisse Fierro Arcos
Institute for Marine and Antarctic Studies
University of Tasmania

My name is Denisse Fierro Arcos, I am a marine scientist originally from Ecuador but currently living in Australia. At the moment I am close to finishing my PhD program which focuses on developing best practices for the use of oceanographic models in marine ecosystem studies. I am also currently working as a researcher at the University of Tasmania where I am developing a marine ecosystem model that will allow us to project the possible impacts of climate change on marine species and fisheries. It is precisely this model that is the focus of my project with the Champions Program. We want to publish a package in R that will allow other marine researchers to easily run our model.

Durga Valentina Linares Herrera

Durga Valentina Linares Herrera
Research Center of the Universidad del Pacífico (CIUP)

Hello! My name is Durga Valentina Linares Herrera and I live in Lima, Peru. I am a social scientist, with a degree in Political Science from the Universidad Antonio Ruiz de Montoya and a diploma in Data Science for Social Sciences and Public Management from the Pontificia Universidad Católica del Perú. I work as a research assistant at the Research Center of the Universidad del Pacífico, where I explore the intersection between technology and work from a sociological perspective and with mixed methodologies. My current projects revolve around the Peruvian labor market and its exposure to artificial intelligence, and the evolution of the strike as a social phenomenon in Peru in the last three decades.

For the program I presented as a project {epen}, an R package to download, process, and analyze microdata from the Permanent National Employment Survey and its predecessor, the Lima EPE. This idea was born from my experience working directly with these databases. I could see how difficult it can be to access public labor data in a fast, clear, and reproducible way, especially for those of us who come from the social sciences and do not always come to these tools with a previous technical background, as was also my case. With this package, I seek to facilitate that path for researchers, students, and public sector analysts who need to build reliable labor indicators without having to start from scratch each time, something I find especially relevant in this time of so many changes.

In recent years, with the intention of developing new skills in a context where technology is quickly advancing, I moved towards a more quantitative approach and learned to program in a self-taught way with free resources on the internet. That process has been important for my professional development, but it also made me want to turn that learning into something useful for other people: a tool to help reduce the entry barriers that I myself encountered when I started. That’s how the motivation behind this package came about, and this Champions Program appeared just at the right time. I am excited to participate in an initiative like this and to be part of a network like rOpenSci, whose commitment to a more diverse and inclusive open science seems very valuable to me. I hope that this experience will allow me to consolidate in community a path that I once started on my own and pave the way for future packages aimed at improving access to Peruvian public data in R.

Evelia Lorena Coss Navarrete

Evelia Lorena Coss Navarrete
LIIGH-UNAM

I am a postdoctoral researcher specialized in transcriptomics and single cell data analysis. My research focuses on the study of transcriptomic profiles of Mexican patients with lupus, with the aim of better understanding the biological mechanisms of the disease and providing reproducible bioinformatics tools that strengthen both biomedical research and academic training in Mexico and Latin America.

I am a Biotechnology Engineer from the Polytechnic University of Sinaloa (UPSIN), originally from Mazatlan, and I did my Master’s and PhD in Plant Biotechnology at Cinvestav, where I studied the conservation of lncRNAs in plants.

My participation in the rOpenSci Champions Program seeks to strengthen communities such as VieRnes de Bioinformatics, R-Ladies Morelia and RSG-Mexico, promoting the creation of R packages with international standards, reproducible documentation and open review. My vision is to bridge the gap between the global rOpenSci community and local initiatives in Latin America, promoting a more open, inclusive and sustainable science.

Gladys Choque Ulloa

Gladys Choque Ulloa
University of São Paulo
Founder of Women in DataLab

Hello! My name is Gladys Choque Ulloa, I am originally from Peru and currently reside in Brazil. I have a degree in Statistics, a Master’s in Statistics, and I am currently pursuing my PhD in Computer Science at the ICMC-USP of the University of São Paulo, Brazil. There, I am developing research in computational neuroscience focused on the automatic diagnosis of mental disorders, using Machine Learning models, Neural Networks, LLMs, Causal Inference, and Time Series. In addition to my academic work, I am founder of the organization Women in DataLab, where I work to reduce the gender gap in technology and data science.

I applied to the rOpenSci Champions Program because I strongly believe in the power of open science and reproducible software to democratize knowledge. During the program, my goal is to hone my skills in developing R tools and scientific software under global standards. With this experience I want to strengthen my technical profile and act as a bridge for more researchers in Latin America to adopt collaborative and open practices, enhancing the impact of our scientific community internationally.

José Daniel Conejeros Pavez

José Daniel Conejeros Pavez
Lagrange Fellow, ISI Foundation
Early Career Researcher, SENTINET – UC

I am José Daniel Conejeros, MSc in Statistics from the Pontificia Universidad Católica de Chile. I am originally from Chile and currently develop my work between Chile and Italy. In Italy I work as a Lagrange Fellow at the ISI Foundation and as a young researcher at the SENTINET Center (Surveillance, epidemiology and new technologies for emerging infectious threats), working on issues of data science, complex systems, and public health.

My work is situated at the intersection of statistics, epidemiology, and computational social science to understand infectious disease dynamics and health outcomes for different populations. Currently, I am developing spatmask, an R package oriented to the masking and anonymization of spatial data (geomasking), with the goal of enabling reproducible analyses without compromising the privacy of individuals. This project aims to bridge the gap between the use of sensitive data for research and the ethical and regulatory restrictions that limit its access and use.

I decided to apply to the Champions Program because many of the challenges I face are not only technical, but also organizational and cultural. I want to understand how to develop scientific software collaboratively, how to document it correctly and how to foster reproducible practices in contexts where open science is not yet fully installed. I am interested in learning how to build tools that not only work, but that are understandable, auditable, and useful for research teams and Public Policy decision makers.

Through this program, I hope to strengthen my skills in scientific software development, open review and collaborative work. My goal is to translate this learning into concrete transfer: training, collaboration, and community building around open science in the region.

Linda Cabrera Orellana

Linda Cabrera Orellana
R-Ladies Ecuador

Hello! My name is Linda, I am Ecuadorian and currently reside in Granada, Spain. I work as a Senior Data Analyst in a digital marketing agency and I share my experience as a teacher in business schools, where I teach classes on AI applied to analytics and data visualization.

My project consists of developing an R package with an educational approach to detect, explain, and help correct common structural errors in manually created datasets, specially designed for people with no technical background in data. I am applying to the program because I want to turn my practical and teaching experience into a real contribution to the open scientific software ecosystem, and to strengthen my skills in R package design, documentation, and maintenance.

I also hope that the project will serve as an educational resource in universities in Ecuador and in the R-Ladies community, and that it will build a bridge between those who collect data and those who analyze it, promoting data literacy in Spanish.

María Florencia Tames

María Florencia Tames
National University of Córdoba
Argentina

My name is María Florencia Tames, I am a professor at the National University of Córdoba (Argentina) and I work in research in the area of air quality, exposure to air pollutants and environmental inequalities in urban contexts in Latin America.

With a team I developed the AirExposure R package, a tool for estimating daily exposure to air pollutants by integrating information on ambient concentrations, mobility, and daily routines. Through the rOpenSci Champions Program, my goal is to improve and consolidate this package as an open, reproducible and accessible tool for the community.

I am especially interested in strengthening my skills in open software development, incorporating best practices such as documentation, testing, and peer review, and learning to build tools that can be used beyond their original research context.

I am also motivated by the program’s focus on community and working in Spanish, as access to programming resources and training is often limited by language. I hope to be able to share what I have learned through training activities and contribute to the development of an open scientific software community in Latin America.

Marina Cecilia Cock

Marina Cecilia Cock
INCITAP (CONICET-UNLPam)
National University of La Pampa

I am from Santa Rosa, La Pampa, Argentina. I have a degree in Natural Resources and Environment Engineering from the National University of La Pampa (UNLPam) and a PhD in Agrarian Sciences with orientation in Ecology from the University of Buenos Aires (UBA).

I am currently working as a research assistant at the National Council of Scientific and Technical Research (CONICET) and as a teaching assistant in Biogeography and Statistics at the National University of La Pampa.

I applied to the program to learn about the R package review process. I am interested in participating because I find it especially valuable to join a community where learning is shared and collaborative. I am looking to strengthen my knowledge in R and then be able to transmit it both in my teaching role and within the R community.

Patricia Andrea Loto

Patricia Andrea Loto
FACENA, National University of the Northeast (UNNE)

I have a degree in Information Systems and a Diploma in Data Science, Machine Learning and its Applications. I am currently pursuing a Master’s Degree in Information Technology at the Universidad Nacional del Nordeste. I live and work in Argentina.

I work as a software developer and data analyst in the public sector, and as a university teacher in the area of systems and programming. I am a co-founder of RSE Argentina, member of the organizing committee of LatinR, and co-organizer of R-Ladies Resistencia-Corrientes. From these spaces I work to promote open science, reproducible research software and technology inclusion in Latin America.

In the 2026 cohort I am participating as mentee with the guidance of Guadalupe Pascal. My project is an R package to systematize the creation of Software and Data Management Plans through standardized templates developed with Quarto, which facilitate the documentation, preservation, and reuse of data and scientific code under international open science standards.

I applied to the Champions Program because I see it as a bridge: between where I am technically and where I want to be. I hope to learn about the rOpenSci peer review process, deepen my understanding of testing, technical documentation, and everything that makes a package really useful to others. I’m also looking to connect with a network of developers and researchers who share open science values. Finally, I want research software developed from the Global South to have greater visibility and for our contributions to be recognized, and I think rOpenSci is the ideal platform for that.

Estefania Torrejón

Estefania Torrejón
NOVA Medical School. Lisbon, Portugal

I am Peruvian and a biologist graduate from the Universidad Nacional Mayor de San Marcos (UNMSM), with a Master’s degree in Biomedical Sciences from the Institute of Hygiene and Tropical Medicine (IHMT) in Lisbon, Portugal. I currently reside in Portugal, where I work as a predoctoral researcher in bioinformatics at the Metabolic Diseases Research Lab of NOVA Medical School. I am also director of the International Relations Department of the Peruvian Society of Bioinformatics and Computational Biology.

The project I am developing in the framework of the rOpenSci Champions Program consists of the preparation and submission of my R package, EV-Net, to the CRAN repository. EV-Net is a bioinformatics tool that identifies and prioritizes molecules present in the cargo of extracellular vesicles (EVs) with high regulatory potential on a receptor tissue of interest. EVs are structures surrounded by a lipid bilayer that carry a great diversity of active molecules. These vesicles can move through the organism and reach specific tissues, which is why they are recognized as key mediators of cell-to-cell communication (CCC). However, most of the bioinformatics tools available to study CCC do not consider EV-mediated communication. To address this limitation, my collaborators, supervisors, and I developed EV-Net.

Being an rOpenSci Champion is an invaluable opportunity to receive the necessary training, coaching and mentoring to bring EV-Net up to the required quality standards and to be successfully incorporated into CRAN.

Next steps

With the presentation of this new group of Champeons, we begin the fourth edition of the program, the second in Spanish. This group has already started the training stage and met their mentors. They will be working for 12 months developing new packages, preparing existing packages to submit to the peer review process, and reviewing other people’s packages.

If you want to follow the development of their projects and where and when their dissemination and communication activities will take place, don’t miss our blog articles, news in our newsletter and social networks.

To leave a comment for the author, please follow the link and comment on their blog: rOpenSci - open tools for open science.

R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.

Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

Continue reading: Eleven Latin American Voices for Open Science: The New Cohort of Champions rOpenSci 2026

Little useless-useful R functions – Ulam Prime Spiral

tomaztsql — Sun, 07 Jun 2026 17:00:17 +0000

[This article was first published on R – TomazTsql, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)

Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

Stanislaw Ulam, Los Alamos, 1963 was bored in a meeting and he started dooddling integers in a spiral and circled the primes. Diagonal lines appeared. He later showed it to Martin Gardner, to Ulam surprise, Gardner published his findings in Scientific American. We are still confused to this day.

Ulam prime spiral (or short Ulam spiral) with dimensions of 150 x 150 and total of 2547prime numbers (11.2 % coverage).

But Ulam was not doodling some little houses or boats, or cars like a normal person. No. He wrote numbers in a spiral. Then circled all the prime numbers. Then stared at what he had created with the dawning horror of a man who has seen too much.

So where does the spiral comes from? Start with 1 in the middle. Write the number in a spiral outward. And then highlight all the prime numbers. If you draw long enough diagonal lines will appear.

Primary function has initial position and directions calculated, in order to have the symmetry of the plot.

ulam_prime_spiral <- function(
    n         = 51,    
    theme     = c("Cosmic","Blueish","Classy","Psycho"),
    show_nums = FALSE,   # print values (n ≤ 21 only)
    animate   = FALSE,       
    speed     = 1,          
    verbose   = TRUE
) {
  
  theme <- match.arg(theme)
  
  #n must be odd so integers have a unique centre cell 
  if (n %% 2 == 0) { n <- n + 1L }
  if (n < 5) stop("Size muste be > 5.")
  total <- n^2
  mid   <- (n + 1L) / 2L            
  
  dr <- c( 0L, -1L,  0L,  1L)   # row deltas:  E  N  W  S
  dc <- c( 1L,  0L, -1L,  0L)   # col deltas:  E  N  W  S
  
  mat   <- matrix(0L, n, n)     
  ord_r <- integer(total)        
  ord_c <- integer(total)        
  
  r <- mid;  
  cc <- mid  
  
  mat[r, cc] <- 1L
  ord_r[1]   <- r
  ord_c[1]   <- cc
  
  d    <- 1L  # current direction index (1=E 2=N 3=W 4=S)
  step <- 1L  # current arm length
  num  <- 2L                     
  
  while (num <= total) {
    for (half in 1:2) {          # each dir is twice 
      for (i in seq_len(step)) {
        r  <- r  + dr[d]
        cc <- cc + dc[d]
        mat[r, cc] <- num
        ord_r[num] <- r
        ord_c[num] <- cc
        num <- num + 1L
        if (num > total) break  
      }
      d <- (d %% 4L) + 1L       # turn left: E→N→W→S→E
      if (num > total) break    
    }
    step <- step + 1L            
  }

and we determine the prime:

# Sieve of Eratosthenes   
  is_prime    <- rep(TRUE, total)
  is_prime[1] <- FALSE
  p <- 2L
  while (p * p <= total) {
    if (is_prime[p])
      is_prime[seq.int(p * p, total, p)] <- FALSE
    p <- p + 1L
  }

  prime_mat <- matrix(is_prime[mat], n, n)
  n_primes  <- sum(prime_mat)
  density   <- 100 * n_primes / total

With for type of visuals (Classy, Psycho, Blueish and Cosmic), if you decide to create a grid smaller than 21×21, you can have also the numbers displayed. And of course, the diagonals are visible as well:

As always, the complete code is available on GitHub in Useless_R_function repository. The complete version of code is here: https://github.com/tomaztk/Useless_R_functions/blob/main/functions/ulam_spiral.R

Check the repository for future updates!

Stay healthy and happy R-coding!

To leave a comment for the author, please follow the link and comment on their blog: R – TomazTsql.

R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.

Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

Continue reading: Little useless-useful R functions – Ulam Prime Spiral

Learning Amino Acids Part 1: Non-Polar Amino Acids, Rodrigues Rotation, and Lennard-Jones Potential

r on Everyday Is A School Day — Sun, 07 Jun 2026 00:00:00 +0000

[This article was first published on r on Everyday Is A School Day, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)

Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

Back to basics! Learning non-polar amino acids, what zwitterions actually are, and dipping into the applied math — Rodrigues rotation and Lennard-Jones potential. Slowly building toward optimal phi/psi!

Motivations

We’ve explored quite a bit lately in molecular dynamic simulation and then protein-protein docking as well the last time. There is still so much to learn. I’ve decided to go back to basics, revisiting our old friends amino acids and try to understand the natural properties behind each one and see if that will make more sense in the future when we’re exploring more. While making notes for myself of all the amino acids, I’ll also try to understand some of the basic math behind the structures. Are you ready !? Lol, I’m not, but let’s go anyway!

Objectives:

Amino Acids

Amino acids are the building blocks of proteins, each sharing a common backbone: a central α-carbon bonded to an amino group (–NH₂), a carboxyl group (–COOH), a hydrogen atom, and a variable side chain (R group) that defines each amino acid’s identity and chemistry.

Non-polar amino acids

Non-polar amino acids have hydrophobic side chains — they avoid water and tend to cluster in the interior of folded proteins, forming the hydrophobic core that drives protein stability. Understanding each one’s shape and bulk is directly relevant to how they pack, how they constrain backbone flexibility, and how substitutions affect enzyme active sites.

library(tibble)
library(kableExtra)

aa_nonpolar <- tribble(
  ~aa,  ~aa3,  ~name,           ~functional_group,  ~smiles_sidechain,    ~charge_ph7, ~mw_da, ~pka,     ~md_note,                                                                                ~main_function,
  "G",  "Gly", "Glycine",       "H (none)",         "[H]",                "Neutral",   75.03,  NA_real_, "Minimal VDW radius; unrestricted phi/psi; near-zero excluded volume",                  "Conformational flexibility; tight turns; active site geometry",
  "A",  "Ala", "Alanine",       "Methyl",           "C",                  "Neutral",   89.09,  NA_real_, "Low steric perturbation; high alpha-helix propensity in force fields",                 "Helix former; hydrophobic core; alanine-scanning mutagenesis",
  "V",  "Val", "Valine",        "Isopropyl",        "CC(C)",              "Neutral",   117.15, NA_real_, "Beta-branching restricts psi; favors extended beta-sheet; large gamma-carbons",        "Beta-sheet core; hydrophobic packing; sickle-cell HbS Glu6Val",
  "L",  "Leu", "Leucine",       "Isobutyl",         "CCC(C)C",            "Neutral",   131.17, NA_real_, "Flexible chi2; common rotamers at -65/-65 and -65/175; high hydrophobic SASA",         "Hydrophobic core; leucine zippers; most abundant non-polar in proteomes",
  "I",  "Ile", "Isoleucine",    "sec-Butyl",        "CCC(C)",             "Neutral",   131.17, NA_real_, "Beta-branching + gamma-branch; most restricted chi1/chi2; large buried SASA",          "Hydrophobic core; beta-barrel interiors; transmembrane helices",
  "P",  "Pro", "Proline",       "Pyrrolidine ring", "C1CCNC1",             "Neutral",   115.13, NA_real_, "Fixed phi ~-60; no backbone NH donor; cis/trans isomerism at Xaa-Pro bond",            "Helix breaker; beta-turns; collagen Gly-Pro-X repeats",
  "F",  "Phe", "Phenylalanine", "Benzyl",           "Cc1ccccc1",          "Neutral",   165.19, NA_real_, "Rigid aromatic ring; pi-pi stacking and cation-pi in MD energy decomposition",         "Hydrophobic core; aromatic clusters; ligand binding pockets",
  "W",  "Trp", "Tryptophan",    "Indolylmethyl",    "Cc1c[nH]c2ccccc12", "Neutral",   204.23, NA_real_, "Indole NH can H-bond; amphipathic at membrane interface; strong 280nm absorbance",     "Membrane anchoring; fluorescence probe; ligand binding; rarest standard AA",
  "M",  "Met", "Methionine",    "Thioether",        "CCSC",               "Neutral",   149.20, NA_real_, "Flexible sulfur geometry; oxidizable to sulfoxide in long MD runs; check reactive FF", "Translation initiation; hydrophobic core; redox sensing"
)

aa_nonpolar |>
  dplyr::select(aa:mw_da) |>
  kbl()

aa	aa3	name	functional_group	smiles_sidechain	charge_ph7	mw_da
G	Gly	Glycine	H (none)	[H]	Neutral	75.03
A	Ala	Alanine	Methyl	C	Neutral	89.09
V	Val	Valine	Isopropyl	CC(C)	Neutral	117.15
L	Leu	Leucine	Isobutyl	CCC(C)C	Neutral	131.17
I	Ile	Isoleucine	sec-Butyl	CCC(C)	Neutral	131.17
P	Pro	Proline	Pyrrolidine ring	C1CCNC1	Neutral	115.13
F	Phe	Phenylalanine	Benzyl	Cc1ccccc1	Neutral	165.19
W	Trp	Tryptophan	Indolylmethyl	Cc1c[nH]c2ccccc12	Neutral	204.23
M	Met	Methionine	Thioether	CCSC	Neutral	149.20

aa_nonpolar |>
  dplyr::select(aa,aa3,md_note,main_function) |>
  kbl()

aa	aa3	md_note	main_function
G	Gly	Minimal VDW radius; unrestricted phi/psi; near-zero excluded volume	Conformational flexibility; tight turns; active site geometry
A	Ala	Low steric perturbation; high alpha-helix propensity in force fields	Helix former; hydrophobic core; alanine-scanning mutagenesis
V	Val	Beta-branching restricts psi; favors extended beta-sheet; large gamma-carbons	Beta-sheet core; hydrophobic packing; sickle-cell HbS Glu6Val
L	Leu	Flexible chi2; common rotamers at -65/-65 and -65/175; high hydrophobic SASA	Hydrophobic core; leucine zippers; most abundant non-polar in proteomes
I	Ile	Beta-branching + gamma-branch; most restricted chi1/chi2; large buried SASA	Hydrophobic core; beta-barrel interiors; transmembrane helices
P	Pro	Fixed phi ~-60; no backbone NH donor; cis/trans isomerism at Xaa-Pro bond	Helix breaker; beta-turns; collagen Gly-Pro-X repeats
F	Phe	Rigid aromatic ring; pi-pi stacking and cation-pi in MD energy decomposition	Hydrophobic core; aromatic clusters; ligand binding pockets
W	Trp	Indole NH can H-bond; amphipathic at membrane interface; strong 280nm absorbance	Membrane anchoring; fluorescence probe; ligand binding; rarest standard AA
M	Met	Flexible sulfur geometry; oxidizable to sulfoxide in long MD runs; check reactive FF	Translation initiation; hydrophobic core; redox sensing

Claude generated most of the above information. We’ll add onto the md_note section as we encounter certain things during our MD sims.

What’s Zwitterion?

A zwitterion is a molecule that has both positive and negative charges but is overall electrically neutral. In amino acids, the amino group (–NH₂) can accept a proton to become positively charged (–NH₃⁺), while the carboxyl group (–COOH) can lose a proton to become negatively charged (–COO⁻). At physiological pH (~7.4), most amino acids exist as zwitterions, with the amino group protonated and the carboxyl group deprotonated. This dual charge allows amino acids to interact with both polar and non-polar environments, contributing to their solubility in water and their ability to form various interactions in proteins.

What Does Non-polar Actually Mean?

It is worth clarifying what “non-polar” actually refers exclusively to the side chain (R group) — specifically that it consists largely of carbon and hydrogen bonds with no net dipole and no ionizable groups, making it hydrophobic and largely indifferent to water. It says nothing about the backbone, which is the same for all amino acids and always carries polar bonds (C=O, N–H). In fact, as mentioned above, all amino acids including non-polar ones exist as zwitterions at physiological pH — a property that comes entirely from the backbone, not the side chain.

Note to self: All amino acids’ backbones are zwitterions; the R-side chain determines polarity and hydrophobicity. Also, net charge neutral == overall charges equals zero, does not mean the molecule is non-polar.

Rodriguez Rotation Formula

Rodrigues’ rotation formula is a method for rotating a 3D vector in space around a specified axis by a given angle. The formula is expressed as:

$v_{rotation} = v.\cos(\theta) + \sin(\theta)(k \times v) + (1 - \cos(\theta))(k(k \cdot v))$

where v is the original vector, k is the unit vector along the axis of rotation, and θ is the angle of rotation in radians.

This formula apparently is very popular in computer graphics and robotics, but I can see how it can be useful in molecular dynamics as well when we want to rotate a molecule or a part of it around an axis. Especially when we want to estimate the least energy conformation of a molecule. The direction application of this formula in amino acid sequence would be in rearranging the atoms based on phi and psi which are the angles of rotations around the N-Cα and Cα-C bonds of the amino acid backbone, respectively. By applying Rodrigues’ rotation formula, we can calculate the new positions of the atoms in the amino acid after rotating them by the specified angles, allowing us to explore different conformations of the molecule. How I remember which angle is which is Nancy Phi (sounds like some detective show and also N->C) and C C Psi (All with S sound, also Carbon to carbon). We’ll leave the hand calculation until next time, but let’s learn how to rotate a coordinate based on an axis with Rodriguez!

Below I’ll write the code first, then explain. Please feel free to use your mouse to hover over the plotly object and check out the coordinates.

library(plotly)
library(pracma)

#### Let's start simple
x1 <- c(0,0,0)
x2 <- c(1,1,1)
x3 <- c(1,2,1)

rodrigues <- function(v, k, theta) {
  k <- k / sqrt(sum(k^2))
  cos(theta)*v + sin(theta)*pracma::cross(k, v) + (1 - cos(theta))*sum(k*v)*k
}

k <- x2 - x1
v <- x3 - x1

result <- rodrigues(v,k,pi/2) #notice this, pi/2 == 90 degrees

pts <- data.frame(
  x = c(x1[1],x2[1],x3[1]),
  y = c(x1[2],x2[2],x3[2]),
  z = c(x1[3],x2[3],x3[3]),
  label = c("x1","x2","x3")
)

pts_rs <- data.frame(
  x = c(x1[1],x2[1],result[1]),
  y = c(x1[2],x2[2],result[2]),
  z = c(x1[3],x2[3],result[3]),
  label = c("x1","x2","x3_new")
)

plot_ly() |>
  add_trace(data=pts, x=~x, y=~y, z=~z,
            type="scatter3d", mode="lines+markers+text",
            text=~label,
            marker=list(size=8, color="blue", opacity=0.5),
            line=list(width=4, color="blue", dash="solid")) |>
  add_trace(data=pts_rs, x=~x, y=~y, z=~z,
            type="scatter3d", mode="lines+markers+text",
            text=~label,
            marker=list(size=8, color="red", opacity=0.5),
            line=list(width=4, color="red", dash="dash"))

So with the above, we want to start off with 3 points, x1, x2, x3. they all represent their xyz coordinates.

Funny thing is, xyz coordinate here is different from what I learnt xyz as. I’ve always thought x is horizontal, y is vertical and z is depth. But in this case, x is depth, y is horizontal and z is vertical. I guess it depends on how you look at it.

Then we want to rotate x3 around the axis defined by x1 and x2 by 90 degrees (pi/2 radians). The rodrigues function takes in the vector v (which is the vector from x1 to x3), the axis k (which is the vector from x1 to x2), and the angle theta (which is pi/2). It returns the new coordinates of x3 after rotation.

Finally, we plot the original points and the rotated point using plotly. The original points are in blue, and the rotated point is in red. You can hover over the points to see their coordinates.

Now if we were to maneuver the 3d plot and align both x1 and x2 into a dot, we can clearly see that it moved 90 degrees anti-clockwise!

Now there is a pretty cool rule to know where the rotation should occur anti-clockwise vs clockwise is by using your hand !!! Remember this from high school?

It would be really cool to derive the above formula. There are a lot of videos that have done this. I’m still trying to conceptualize it, let’s leave that for another blog! It sounds interesting and may be a good exercise, especially when we’re venturing into 3d spaces.

Lennard-Jones Potential Energy

The Lennard-Jones potential energy formula is a mathematical model used to describe the interaction between a pair of neutral atoms or molecules. It is given by the equation:

$V(r) = 4\epsilon \left[ \left( \frac{\sigma}{r} \right)^{12} - \left( \frac{\sigma}{r} \right)^6 \right]$

Where:

$V(r)$ is the potential energy as a function of the distance $r$ between the two particles.
$\epsilon$ is the depth of the potential well, representing the strength of the attractive interaction.
$\sigma$ is the finite distance at which the inter-particle potential is zero, representing the effective diameter of the particles.
The term $\left( \frac{\sigma}{r} \right)^{12}$ represents the repulsive part of the potential, which dominates at short distances due to the Pauli exclusion principle.
The term $\left( \frac{\sigma}{r} \right)^6$ represents the attractive part of the potential, which dominates at longer distances due to van der Waals forces.

The Lennard-Jones potential is widely used in molecular dynamics simulations to model the interactions between non-bonded atoms or molecules, particularly in the context of van der Waals forces. It helps to predict the behavior of particles in a system, such as their equilibrium positions and the energy landscape of molecular interactions.

Wow there are a bunch of terms and word above! I’m getting dizzy just to keep track of what is what. Let’s push through this. To use the above formula, we’d have to have some understanding of the parameters epsilon and sigma. These parameters are typically derived from experimental data or quantum mechanical calculations and are specific to the types of atoms or molecules involved in the interaction. Where to get these parameters? Here you go – openbabel:: gaff.dat

When you opened gaff.dat there are bunch of numbers! Let’s find the numbers that are meaningful for us. All the below are separated by new lines as you scroll down.

Bond Stretch

The column names should be: type mass(g/mol) polarizability(Å³) source

Bond Angle

The column names should be: types K r0 source count rmsd

Proper Dihedral

The column names should be: types div barrier phase periodicity

Non-bonded

The column names should be: type R*(Å) ε(kcal/mol)

We are purely interested in the non-bonded section where R* is our sigma = R* × 2 / 2^(1/6) and ε is our epsilon. With the above parameters, we can then calculate the Lennard-Jones potential energy between any two atoms in a molecule. Let’s do a simple calculation for ethanol.

Calculating LJ

library(tidyverse)
library(igraph)
# Coordinates (x, y, z) in Angstroms, according to pubchem ethanol molecule
coords <- rbind(
  O   = c( -1.1712,   0.2997,   0.0000),
  C2  = c( -0.0463,  -0.5665,   0.0000),
  C1  = c(  1.2175,   0.2668,   0.0000),
  H4  = c( -0.0958,  -1.2120,   0.8819),
  H5  = c( -0.0952,  -1.1938,  -0.8946),
  H1  = c(  2.1050,  -0.3720,  -0.0177),
  H2  = c(  1.2426,   0.9307,  -0.8704),
  H3  = c(  1.2616,   0.9052,   0.8886),
  H6  = c( -1.1291,   0.8364,   0.8099)
)

# AMBER GAFF parameters (sigma Å, epsilon kcal/mol)
Rstar_to_sigma <- function(Rstar) 2 * Rstar / 2^(1/6)

sigma <- c(
  C1=Rstar_to_sigma(1.9080), C2=Rstar_to_sigma(1.9080),
  O=Rstar_to_sigma(1.7210),
  H1=Rstar_to_sigma(1.4870), H2=Rstar_to_sigma(1.4870),
  H3=Rstar_to_sigma(1.4870), H4=Rstar_to_sigma(1.4870),
  H5=Rstar_to_sigma(1.4870), H6=0.0000
)

epsilon <- c(
  C1=0.1094, C2=0.1094, O=0.2104,
  H1=0.0157, H2=0.0157, H3=0.0157,
  H4=0.0157, H5=0.0157, H6=0.0000
)

# Bonds 
bonds <- tribble(
  ~from, ~to,
  "C1", "C2",
  "C2", "O",
  "O", "H6",
  "C1", "H1",
  "C1", "H2",
  "C1", "H3",
  "C2", "H4",
  "C2", "H5"
)

# count bonds between two atoms
g <- graph_from_data_frame(bonds, directed = FALSE)
g_dist <- distances(g)

# LJ function
lj <- function(r, eps, sig) 4 * eps * ((sig/r)^12 - (sig/r)^6)

# Loop all pairs
atoms <- rownames(coords)
pairs <- combn(atoms, 2, simplify=FALSE) # combn so we don't repeat
total_V <- vector(mode = "numeric", length = length(pairs)) 
 
for (i in 1:length(pairs)) {
  
  # each pair
  p <- pairs[[i]]
  from <- p[1]
  to <- p[2]
  num_bond <- g_dist[from, to]
  
  if (num_bond <= 2) next                   
  
  # params needed for LJ
  r   <- sqrt(sum((coords[from,] - coords[to,])^2))
  sig <- (sigma[from] + sigma[to]) / 2
  eps <- sqrt(epsilon[from] * epsilon[to])
  
  # scale if num bond is 3 (4 atoms)
  scale <- if (num_bond == 3) 0.5 else 1.0
  
  # LJ calc
  V <- scale * lj(r, eps, sig)

  cat(from, "-", to, " num of bonds=", num_bond, " r=", r, " V=", V, "\n")
  total_V[i] <- V
}

## O - H1  num of bonds= 3  r= 3.344395  V= -0.02733269 
## O - H2  num of bonds= 3  r= 2.642383  V= 0.110613 
## O - H3  num of bonds= 3  r= 2.659841  V= 0.09535471 
## C1 - H6  num of bonds= 3  r= 2.546942  V= 0 
## H4 - H1  num of bonds= 3  r= 2.521587  V= 0.01461147 
## H4 - H2  num of bonds= 3  r= 3.074579  V= -0.007593085 
## H4 - H3  num of bonds= 3  r= 2.514978  V= 0.01576022 
## H4 - H6  num of bonds= 3  r= 2.295394  V= 0 
## H5 - H1  num of bonds= 3  r= 2.507028  V= 0.01720963 
## H5 - H2  num of bonds= 3  r= 2.510736  V= 0.01652426 
## H5 - H3  num of bonds= 3  r= 3.070262  V= -0.007612401 
## H5 - H6  num of bonds= 3  r= 2.845344  V= 0 
## H1 - H6  num of bonds= 4  r= 3.550289  V= 0 
## H2 - H6  num of bonds= 4  r= 2.908137  V= 0 
## H3 - H6  num of bonds= 4  r= 2.392984  V= 0

cat("\nTotal V_LJ:", sum(total_V), "kcal/mol\n")

## 
## Total V_LJ: 0.2275351 kcal/mol

Alright, the above we basically were trying to calculate all pairwise atoms that is more than 3 bonds (if it’s exactly 3 bonds, we scale it by half). Alright, now that we know how to do that on simple molecular, next time we can use this to minimize on as we’re seeking optimal phi and psi!

Opportunities For Improvement

Derive Rodriguez Rotation formula
put both Rodriguez rotation formula and LJ calculation into action to find the optimal phi and psi of amino acid sequence of a protein!
need to include secondary/tertiary structure interactions too

Lessons learnt

refreshed on rotation and vectors
learnt rodriguez rotation formula
learnt LJ formula
learnt what gaff.dat actually contains.
learnt about phi and psi and what they actually mean.
learnt about net charge == total charge; polar vs non-polar is R-chain dependent.

If you like this article:

please feel free to send me a comment or visit my other blogs
please feel free to follow me on BlueSky, twitter, GitHub or Mastodon
if you would like collaborate please feel free to contact me

To leave a comment for the author, please follow the link and comment on their blog: r on Everyday Is A School Day.

R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.

Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

Continue reading: Learning Amino Acids Part 1: Non-Polar Amino Acids, Rodrigues Rotation, and Lennard-Jones Potential

Five recent R-universe features you might have missed

rOpenSci — Sun, 07 Jun 2026 00:00:00 +0000

[This article was first published on rOpenSci - open tools for open science, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)

Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

One of the challenges of working on R-universe is that there is never really a release day.

Unlike software projects that accumulate changes for a big launch, R-universe evolves continuously. New features, infrastructure improvements, UI tweaks, and build system enhancements are silently deployed all the time without most people noticing.

Every now and then, however, a few features emerge that are worth highlighting. In this technote we look at five recent additions that make R-universe a little nicer, faster, or more convenient to use.

1. Social media cards that actually look good

Sharing package links on social media used to be a somewhat underwhelming experience, but not anymore! We provide beautiful preview images every package, article, and universe, for example:

Each card includes package or universe stats and is automatically exposed through the appropriate HTML headers (og:image, og:title, etc). Whenever somebody shares a package link, the preview should look a bit more polished without requiring any work from package maintainers.

R-universe now generates social media preview cards for each package, like this one: ropensci.r-universe.dev/targets. You can also get the card manually from the /{package}/card.png API (or svg).

[image or embed]
— Jeroen Ooms (@jeroenooms.bsky.social) 12:01 · May 2, 2026

When a link to a vignette article is shared, R-universe automatically extracts the title and section headings from the document to generate a more informative description. For example this one.

All this won’t guarantee your package goes viral, but at least it looks cool

2. PACKAGES.rds support (or: implementing R internals in JavaScript)

This feature is mostly invisible, but improves performance of installing packages in R, and therefore also the workflow of building packages in R-universe:

Every CRAN-like repository needs an index file which lists all the content from that repo. This file may be provided in a text-based PACKAGES format and/or a binary PACKAGES.rds format (rds is R’s internal binary serialization format, see ?saveRDS).

Historically R-universe supported only the former text-based format, because all repository metadata is generated on-request in JavaScript on the server side, and emitting DCF text streams from our database is fast and easy. However, on the R side, loading RDS is a bit faster than parsing a text, which becomes noticeable for large repositories like https://bioc.r-universe.dev/.

So therefore we now also serve the PACKAGES.rds files. The implementation exists in this NPM package which reverse engineers a subset of the R RDS serializer, so that we can easily run it in our express stack. On MacOS and Windows it defaults to the new zstd compression, which makes it even faster than CRAN.

3. Fancy sort/filter bars in the WebUI

The styling of universe-level pages that list packages, articles, and datasets have been improved, gaining some nice interactive filter and sort capabilities. For example the /packages page now allows you to do a (fuzzy) search looking for keywords that appear in package descriptions/tags/authors/etc, and sort the packages based on their of stars / downloads / dependents / etc.

A similar filter bar is available on the /articles and /datasets pages to help you search those as well.

4. For the impatient: trigger a sync manually

R-universe automatically checks for updates in upstream git repositories and package registries approximately once per hour. Occasionally, however, you have just pushed a commit, fixed a build issue, updated a vignette, or corrected a typo, and waiting an hour suddenly feels like a very long time.

To accommodate the impatient among us, a new sync button has been added to the universe sidebar. Clicking the button immediately triggers a sync to check for any updates.

5. Making check results easier to find and share

For some organizations, package checks are among the most important parts of R-universe. We’ve made several improvements to make check results easier to access and easier to share with collaborators.

First, package pages now support direct links to the check table using the #checktable anchor, for example: https://jeroen.r-universe.dev/curl#checktable. Second, build logs and build artifacts linked in this table can now be downloaded without requiring GitHub authentication. The underlying files still live on GitHub Actions, but R-universe now proxies the download links. This means users can access logs and build artifacts directly from the package page, even if they do not have a GitHub account. So “I don’t have GitHub” is no longer available as an excuse for ignoring check failures.

Finally, build runs now include a deployment summary generated via the GitHub Actions Job Summaries feature: navigate to any build run and scroll down, there you find a summary table showing exactly the data deployed to R-universe during that run, including package checks and deployment results. This makes it easier to inspect builds directly from GitHub without having to cross-reference multiple pages.

To leave a comment for the author, please follow the link and comment on their blog: rOpenSci - open tools for open science.

R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.

Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

Continue reading: Five recent R-universe features you might have missed

Welcome Joe Zhu

Ross Farrugia — Thu, 04 Jun 2026 00:00:00 +0000

[This article was first published on pharmaverse blog, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)

Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

Hi pharmaverse community,

I wanted to share that Joe Zhu will be taking over from me (Ross Farrugia) as the Roche/Genentech representative on our pharmaverse council.

Having been one of the initial co-founders of pharmaverse back in mid-2020, it feels like the right time after 6 years to bring fresh perspectives and energy, and I couldn’t think of anyone better than Joe for this. I’m incredibly proud (as I hope all of our community are) of how far we’ve come doing our bit to bring countless companies and individuals together to make open source collaborations a reality across pharma.

In our early years, I had the privilege to present in front of a distinguished board and someone there challenged me with “good luck in getting pharmaceutical companies to share code”, in reply I shared the story of admiral and pharmaverse, and they were blown away. In all their years of experience they had never imagined something like this would ever succeed at scale. It is testament to every single one of you in our community (and thanks to supporters like PHUSE) that it has and it continues to grow!

I won’t be going far away, as now I get to continue as an individual contributor to pharmaverse.

Here’s some more details on Joe and where you can learn more about our council:

Bio:

Joe Zhu

Joe Zhu is a Principal Data Scientist at Roche and the Chief Product Owner and Lead Engineer for the company’s NEST project. A prominent advocate for open-source software in the global pharmaceutical sector, Joe co-founded both the China Pharma R User Group (Pharmarug) and R in Pharma APAC. He is dedicated to bridging the gap between advanced statistics, genomics research, and clinical trial software.

To learn more about our council please check our site page, and you can reach this group anytime using: pharmaverse.council@phuse.global.

Last updated

2026-06-04 07:19:08.001299

Details

Source, Session info

Reuse

CC BY 4.0

Citation

BibTeX citation:

@online{farrugia2026,
  author = {Farrugia, Ross},
  title = {Welcome {Joe} {Zhu}},
  date = {2026-06-04},
  url = {https://pharmaverse.github.io/blog/posts/2026-06-05-welcome-joe-zhu/welcome-joe-zhu.html},
  langid = {en}
}

For attribution, please cite this work as:

Farrugia, Ross. 2026. “Welcome Joe Zhu.” June 4, 2026. https://pharmaverse.github.io/blog/posts/2026-06-05-welcome-joe-zhu/welcome-joe-zhu.html.

To leave a comment for the author, please follow the link and comment on their blog: pharmaverse blog.

R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.

Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

Continue reading: Welcome Joe Zhu

Football meets machine learning: Forecasting the 2026 FIFA World Cup

Achim Zeileis — Tue, 02 Jun 2026 22:00:00 +0000

[This article was first published on Achim Zeileis, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)

Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

Probabilistic forecasts for the 2026 FIFA World Cup are obtained by using a hybrid model that combines data, expert insights, and advanced statistical models. The favorite is Spain, closely followed by England, France, and Germany.

Football fans around the world are looking forward to the kick-off of the 2026 FIFA World Cup in Canada, Mexico, and the United States next week. 48 of the best teams from all around the world will compete from 11 June to 19 July to determine the new World Champion. In anticipation of the tournament the big question is who among the teams will succeed, who will drop out, and who will eventually prevail. While it is, of course, not yet possible to give definitive answers to these questions, we are able to provide probabilistic forecasts for all possible matches using a refined machine learning algorithm. This allows us to explore the likely course of the tournament by simulation.

Winning probabilities

The forecast is based on a machine learning algorithm that blends a variety of different sources of information: An ability estimate for every team based on historic matches; an ability estimate for every team based on odds from 24 bookmakers; average ratings of the players in each team based on their individual performances in their home clubs and national teams; the average market value of all players in each team according to a wisdom-of-the-crowd approach; further team and country covariates (e.g., FIFA and Elo ratings or GDP). A machine learning algorithm is trained on the results of all major football tournaments (Men’s World Cups and Euros) between 2006 and 2024 and then applied to current information to obtain a forecast for the 2026 FIFA World Cup. More specifically, the algorithm estimates the predicted number of goals for all possible matches between all 48 teams in the tournament. Based on the predicted goals the probabilities for each potential outcome (i.e., 0-0, 1-0, 0-1, 2-0, etc.) in each of these matches can be computed from a bivariate Poisson distribution (here: assuming independence). This allows us to simulate all matches in the group phase and which teams proceed to the knockout stage and who eventually wins. Repeating the simulation 100,000 times yields winning probabilities for each team. The results show that Spain is the favorite for the title with a winning probability of 14.5%, closely followed by England and France, both with 12.4%, and Germany with 11.2%. The winning probabilities for all teams are shown in the barchart below with more information linked in the interactive full-width version.

Interactive full-width graphic

The study has been conducted by an international team of researchers: Andreas Groll, Agamyrat Hanekov, Lars Magnus Hvattum, Rouven Michels, Gunther Schauberger, Elina Sukhanova, Sebastian Witte, Achim Zeileis. The basic idea for the forecast is to proceed in two steps. In the first step, sophisticated statistical models as well as expert insights are employed to determine the strengths of all teams and their players using disparate sets of information. In the second step, a machine learning algorithm decides how to best combine the strength estimates with other information about the teams.

Historic information: Match abilities.
An ability estimate is obtained for every team based on “retrospective” data, namely all historic national matches over the last 8 years (freely curated by Mart Jürisoo on Kaggle). A bivariate Poisson model with team-specific fixed effects and assuming independence is fitted to the number of goals scored by both teams in each match. However, rather than equally weighting all matches to obtain average team abilities (or team strengths) over the entire history period, an exponential weighting scheme is employed. This assigns more weight to more recent results and thus yields an estimate of current team abilities. More details can be found in Ley, Van de Wiele, Van Eetvelde (2019).
Future expectation: Bookmaker consensus abilities.
Another ability estimate for every team is obtained based on “prospective” data, namely the odds of 24 international bookmakers that reflect their expert expectations for the tournament. Using the bookmaker consensus model of Leitner, Zeileis, Hornik (2010), the bookmaker odds are first adjusted for the bookmakers’ profit margins (“overround”) and then averaged (on a logit scale) to obtain a consensus for the winning probability of each team. To adjust for the effects of the tournament draw (that might have led to easier or harder groups for some teams), an “inverse” simulation approach is used to infer which team abilities are most likely to lead up to the consensus winning probabilities.
Individual player contributions: Average player ratings.
To infer the “contributions of individual players” in a match, the plus-minus player ratings of Pantuso & Hvattum (2021) dissect all matches with a certain player (both on club and on national level) into segments, e.g., between substitutions. Subsequently, the goal difference achieved in these segments is linked to the presence of the individual players during that segment. This yields individual ratings for all players that can be aggregated to average player ratings for each team.
Wisdom of the crowd: Average market values:
Another way to reflect the current quality and the future potential of each player in a team is to consider their expected market value. As the real market values are unknown, the Transfermarkt web portal employs a “wisdom-of-the-crowd” approach to determine current expected market values for all players. These are based on discussions relying on publicly available data among the online community members of the portal and moderated and consolidated by expert community members and the portal’s employees.
Combination with present status: Hybrid random forests.
Finally, machine learning is used to combine these four highly aggregated and informative variables with a broad range of further relevant covariates reflecting the current states of the different teams and the countries they come from. Such a hybrid approach was first suggested by Groll, Ley, Schauberger, Van Eetvelde (2019). A random forest algorithm is trained to decide how to blend the different ability estimates with team-specific features that are typically less informative but still powerful enough to enhance the forecasts. The features considered comprise team-specific details (e.g., FIFA rank, Elo rating, number of Champions League players) as well as country-specifc socio-economic factors (such as GDP per capita). By combining a large ensemble of rather weakly informative regression trees in a random forest, the relative importances of all the covariates can be inferred automatically. The resulting predicted number of goals for each team can then finally be used to simulate the entire tournament 100,000 times.

Match probabilities

Using the forecasts from the machine learning algorithm yields the predicted number of goals for both teams in each possible match. The explanatory information used for this is the difference between the two teams in each of the variables listed above, i.e., the difference in historic match abilities (on a log scale), the difference in bookmaker consensus abilities (on a log scale), difference in average player ratings of the teams, difference in log market values, etc. The predicted number of goals for the two teams in each match can then be plugged as expectations into two independent Poisson distributions, from which we can compute the probability that a certain match ends in a win, a draw, or a loss. The same can be repeated in overtime, if necessary, and a coin flip is used to decide penalties, if needed.

The following heatmap shows for each possible combination of teams the probability that one team beats the other team in a knockout match. The color scheme uses green vs. purple to signal probabilities above vs. below 50%, respectively. The tooltips for each match in the interactive version of the graphic also print the probabilities for the match to end in a win, draw, or loss after normal time.

Interactive full-width graphic

Performance throughout the tournament

As the goals for both teams in every single match can be simulated with the approach described above, it is also straightfoward to simulate the entire tournament (here: 100,000 times) providing “survival” probabilities for each team across the different stages.

Interactive full-width graphic

Odds and ends

All our forecasts are probabilistic, clearly below 100%, and hence by no means certain. Although we can quantify this uncertainty in terms of probabilities from a multiverse of potential tournaments, it is far from being predetermined which of these potential tournaments we will eventually see during the actual tournament.

Nevertheless the probabilistic view provides us with some interesting insights: For example, compared to predictions for previous tournaments (see e.g., 2018, 2022), it is even more uncertain who will win the title as there are a number of teams with good (albeit none with very high) chances of winning the tournament. An important factor for this is the substantially increased size of the tournament with 48 teams (rather than the previous 32) and an additional knockout round. Also, the tournament draw is much more variable, because 8 of the 12 third-ranked teams proceed to the knockout stage with 495 (!) possible permutations for mapping groups to matches in the round of 32.

Moreover, comparing our forecasts to those based only on the bookmakers odds, it is striking that Germany is ranked 4th, closely behind the three top teams, while it is only ranked 7th by many bookmakers. Conversely, Brazil and Argentina are typically ranked higher by the bookmakers but perform worse in our machine-learning-calibrated simulation.

In any case, all of this means that the probabilistic forecasts leave a lot of room for surprises and excitement during the 2026 FIFA World Cup. But what is absolutely certain is that we look forward to an entertaining tournament as football fans (much more than as professional forecasters).

To leave a comment for the author, please follow the link and comment on their blog: Achim Zeileis.

R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.

Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

Continue reading: Football meets machine learning: Forecasting the 2026 FIFA World Cup

A Multi-Agent DDQN Strategic Audit Engine for Silver Markets using Keras/Tensorflow

Selcuk Disci — Tue, 02 Jun 2026 12:46:44 +0000

[This article was first published on DataGeeek, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)

Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

1. Introduction & Theoretical Framework

In modern electronic trading markets, algorithmic execution engines drive the vast majority of institutional order flows. Evaluating whether these independent, learning-driven trading algorithms behave competitively or tacitly coordinate has become a critical challenge for quantitative compliance, market microstructure design, and risk management.

This technical article implements an automated Strategic Audit Engine designed to evaluate algorithmic execution regimes in the Silver futures market (SI=F). Our framework is explicitly built upon the empirical and theoretical foundations laid out by Koulouris & Campajola (2026) in their groundbreaking paper, “Memory-Induced Supra-Competitive Outcomes Between Deep Reinforcement Learning Agents in Optimal Trade Execution” (arXiv:2605.20348v1, May 2026).

The Core Thesis: Supra-Competitive Outcomes via Memory Paths

Traditional regulatory frameworks look for explicit collusion (active communication or cartel setups). However, Koulouris & Campajola demonstrate a far more subtle phenomenon: when independent Deep Reinforcement Learning (DRL) agents are equipped with memory—meaning they learn from rolling windows of historical price trajectories—they naturally converge toward supra-competitive outcomes. These are states where joint rewards remain artificially high, or execution parameters naturally align to mimic cooperation, without any explicit information exchange.

To audit this behavior empirically, our engine models a symmetric duopoly market interaction. It maps the actual market execution path against two fundamental game-theoretic baselines:

The Cooperative Boundary (TWAP / Pareto Frontier): An idealized, optimal trade execution path where volume is distributed evenly across time to minimize joint market impact and maximize long-term mutual utility.
The Competitive Boundary (Nash Equilibrium): The aggressive, non-cooperative state where individual agents structurally undercut each other, driving execution shortfall parameters to their maximum baseline.

2. Technical Stack & Environmental Setup

To build a production-grade, reproducible multi-agent simulation pipeline, we leverage a hybrid data-science and deep-learning toolkit within the R ecosystem:

tidyquant & tidyverse: Serve as our core data engineering layer, managing financial API queries, formatting continuous return matrices, and handling functional list columns.
keras & tensorflow: Form the algorithmic backbone, allowing us to build, train, and run simultaneous forward/backward passes on Deep Q-Networks.
ggtext & glue: Empower our visualization suite to parse inline HTML canvas rendering and handle dynamic string interpolations smoothly.

# 1. ENVIRONMENT SETUP
if (!require("pacman")) install.packages("pacman")
pacman::p_load(tidyquant, tidyverse, ggtext, glue, keras, tensorflow)

3. Building the Double Deep Q-Network Topology

Following the paper’s thesis on symmetric duopoly interactions, we construct two structurally identical execution agents: agent_A and agent_B. Both utilize a Dense Neural Network (Multilayer Perceptron) architecture to approximate the action-value space, denoted as Q(s, a).

The state space contains 3 features: Price Deviation, Asset Volatility (sigma), and Relative Time Horizon. The output layer projects to 3 discrete strategic action coordinates via a linear activation function.

# 2. SYMMETRIC AGENT ARCHITECTURE
build_strategic_agent <- function(state_size = 3, action_size = 3) {
  model <- keras_model_sequential() %>%
    layer_dense(units = 32, activation = "relu", input_shape = c(state_size)) %>%
    layer_dense(units = 32, activation = "relu") %>%
    layer_dense(units = action_size, activation = "linear")
  
  model %>% compile(
    optimizer = optimizer_adam(learning_rate = 0.001),
    loss = "mse"
  )
  return(model)
}

# Initialize the competing agents
agent_A <- build_strategic_agent()
agent_B <- build_strategic_agent()

4. Parameterization & Historical Replay Buffer Ingestion

To anchor our agents in empirical reality, we pull 2 years of continuous daily settlement prices for Silver futures (SI=F). We define our microstructural bounds—such as the risk aversion parameter (gamma) and the permanent market impact vector (eta)—alongside a fixed strategic execution memory window (T = 10).

# 3. STRATEGIC PARAMETERS
T_horizon <- 10      # Strategic episode length (Memory window)
gamma_param <- 0.0001 # Risk aversion
eta_param <- 0.0005   # Market impact

# 4. HISTORICAL REPLAY DATA (2-Year Training Set)
silver_full <- tq_get("SI=F", from = Sys.Date() - 730) %>%
  filter(!is.na(close)) %>%
  mutate(returns = close / lag(close) - 1) %>%
  drop_na()

# Recent window for the final audit visualization
silver_recent <- tail(silver_full, T_horizon)

5. Dynamic Volatility Corridors

Rather than mapping market behavior against static thresholds, the audit engine computes a volatility-adaptive safety corridor. The boundaries dynamically expand and contract based on the asset’s realized standard deviation (sigma), isolating pure structural noise from intentional strategic maneuvers.

# 5. DYNAMIC SIGMA CORRIDORS
current_sigma <- sd(silver_recent$returns, na.rm = TRUE)
if(is.na(current_sigma)) current_sigma <- 0.01 

analysis_data <- silver_recent %>%
  mutate(
    twap_slope = current_sigma * 1.5, 
    nash_slope = current_sigma * 4.0,
    twap_path = first(close) * (1 - seq(0, first(twap_slope), length.out = n())),
    nash_path = first(close) * (1 - seq(0, first(nash_slope), length.out = n())),
    lower_safety_limit = nash_path * (1 - current_sigma)
  )

6. The Joint Training Replay Engine & Payoff Matrix

This section represents the computational implementation of Koulouris & Campajola’s memory hypothesis. The two agents recursively traverse 2 years of rolling historical windows (window_data).

At each node, they sample independent actions based on their weights, facing a non-cooperative game matrix:

Mutual Cooperation (Action 0, 0): High joint payout (+10) mimicking a stable, supra-competitive margin.
Mutual Aggressive Competition (Action Match): Low joint rent (+1), representing the competitive Nash baseline.
Cheating / Under-cutting: Asymmetric penalization (+5 vs -5).

# 6. JOINT TRAINING ENGINE (Symmetric Memory Interaction)
message("Joint Training: Agent A & Agent B are learning Silver Market dynamics...")

for(i in 1:(nrow(silver_full) - T_horizon)) {
  window_data <- silver_full[i:(i + T_horizon - 1), ]
  vol <- sd(window_data$returns, na.rm = TRUE)
  if(is.na(vol)) vol <- 0.01
  
  state_vec <- matrix(c(1.0, vol, 0.5), nrow = 1)
  
  act_A <- which.max(predict(agent_A, state_vec, verbose = 0)) - 1
  act_B <- which.max(predict(agent_B, state_vec, verbose = 0)) - 1
  
  rewards <- if(act_A == 0 && act_B == 0) {
    list(A = 10, B = 10) 
  } else if(act_A == act_B) {
    list(A = 1, B = 1)   
  } else {
    if(act_A > act_B) list(A = 5, B = -5) else list(A = -5, B = 5) 
  }
  
  target_A <- predict(agent_A, state_vec, verbose = 0)
  target_B <- predict(agent_B, state_vec, verbose = 0)
  
  target_A[1, act_A + 1] <- rewards$A
  target_B[1, act_B + 1] <- rewards$B
  
  agent_A %>% fit(state_vec, target_A, epochs = 1, verbose = 0)
  agent_B %>% fit(state_vec, target_B, epochs = 1, verbose = 0)
}

7. Post-Convergence Audit Inference & Regime Selection

Once the networks stabilize, the engine takes the posture of an unbiased financial regulator. It extracts the neural policy configurations, evaluates the actual current execution window, and automatically determines the market regime using an automated classification layer.

# 7. FINAL AUDIT INFERENCE
analysis_data <- analysis_data %>%
  rowwise() %>%
  mutate(
    state_v = list(matrix(c(close/twap_path, current_sigma, (T_horizon - row_number())/T_horizon), nrow = 1)),
    q_A = list(predict(agent_A, state_v[], verbose = 0)),
    q_B = list(predict(agent_B, state_v[], verbose = 0)),
    joint_action = (which.max(q_A[]) + which.max(q_B[])) / 2
  ) %>% ungroup()

# 8. STATUS LOGIC (Professional Category Selection & Color Alignment)
last_row <- tail(analysis_data, 1)
market_status <- case_when(
  last_row$close >= last_row$twap_path ~ 
    list(
      label = "**COOPERATIVE:** Pareto-Efficient Alignment", 
      bg    = "#E8F8F5",  
      color = "#27AE60"   
    ),
  
  last_row$close < last_row$twap_path & last_row$close >= last_row$nash_path ~ 
    list(
      label = "**NORMAL:** Competitive Nash Equilibrium", 
      bg    = "#FEF5E7",  
      color = "#E67E22"   
    ),
  
  TRUE ~ 
    list(
      label = "**LIQUIDITY SHOCK:** Strategic Deviation Detected", 
      bg    = "#FDEDEC",  
      color = "#C0392B"   
    )
)

8. High-Fidelity Infographic Layer

To generate a publication-quality static vector infographic, we map our theme directly via ggplot2 and ggtext. By embedding the color palette directly into the HTML subtitle strings and forcing label formatting via scales::percent, we create a clean, high-contrast dashboard visualization.

# 9. GGPLOT PRODUCTION VISUALIZATION (Static Mode with ggtext Integration)
ggplot(analysis_data, aes(x = date)) +
  geom_ribbon(aes(ymin = lower_safety_limit, ymax = twap_path), fill = "darkgray", alpha = 0.3) +
  
  geom_line(aes(y = twap_path, color = "TWAP (Cooperative)"), size = 1) +
  geom_line(aes(y = nash_path, color = "Nash (Competitive)"), size = 1) +
  geom_line(aes(y = close, color = "Actual Price"), size = 1.3) +
  scale_y_continuous(labels = scales::label_currency()) +
  
  geom_richtext(
    aes(x = median(date), y = max(close, twap_path) * 1.02, label = market_status$label),
    fill = market_status$bg, color = market_status$color, size = 4,
    family = "Roboto Slab" 
  ) +
  
  scale_color_manual(
    name = NULL,
    values = c("Actual Price" = "steelblue", "TWAP (Cooperative)" = "#27AE60", "Nash (Competitive)" = "#E67E22")
  ) +
  
  labs(
    title = "Silver Market Strategic Audit Engine",
    subtitle = paste0(
      "─── **Cooperative Zone** | ",
      "─── **Competitive Zone** | ",
      "─── **Actual Execution**

",
      "**Strategic Corridor** (Supra-Competitive Margin Zone)"
    ),
    x = NULL, y = NULL,
    caption = glue("Dynamic Sigma: {scales::percent(current_sigma, accuracy = 0.01)} | Shortfall: {round(actual_cost, 2)}%")
  ) +
  
  theme_minimal(base_family = "Roboto Slab") +
  theme(plot.title = element_text(face = "bold", size = 16),
        plot.subtitle = element_markdown(face = "bold"), 
        axis.text = element_text(face = "bold"),
        legend.position = "none")

9. Empirics & Compliance Conclusion

When we run the complete inference loop on our terminal Silver execution window, the strategic narrative clarifies perfectly: Actual Execution (the blue trajectory) tracks downward, bypassing the cooperative upper envelope and adhering directly to the competitive boundaries.

The audit badge cleanly returns a status of NORMAL: Competitive Nash Equilibrium, with the terminal metrics computing the exact execution shortfall at 1.59% as indicated in the chart above. While the agents are technically complex neural networks capable of learning memory patterns, the actual price action during this specific ten-day horizon reflects a highly competitive regime, keeping the execution within standard Nash boundaries rather than shifting into a supra-competitive zone.

For quantitative auditors and systemic risk monitors, this approach signals a paradigm shift. Static threshold tests are blind to multi-agent learning trends. By deploying neural simulation baselines, structural compliance teams can automatically audit execution algorithms, isolating algorithmic alignment from pure market variance.

To leave a comment for the author, please follow the link and comment on their blog: DataGeeek.

R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.

Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

Continue reading: A Multi-Agent DDQN Strategic Audit Engine for Silver Markets using Keras/Tensorflow