Learn how to effectively test your Plumber APIs in R using a two-layer testing strategy that separates business logic from API contracts. [Read more...]
Introduction
The EDAM ontology is used to provide conceptual organization of resources in major initiatives like ELIXIR bio.tools and Galaxy.
Bioconductor’s methods and data are tagged using an ad hoc terminologic hierarchy called biocViews.
In...
Overview
This is an introduction on organizing R projects using source packages (powered by my R package srcpkgs). It is based on notes for a talk I have on 2024-05-27 for the Swiss Institute of Bioinformatics Vital-IT group Analysts meeting.
T... [Read more...]
This is final part of a five part series of related posts on validating R packages.
Other posts in the series are:
Validation Guidelines
Package Popularity
Package Documentation
Code Quality
Maintenance (this post)
At last we come to the final ...
Reposted from the original at: https://blog.stephenturner.us/p/tidy-rag-in-r-with-ragnarRetrieval augmented generation in R using the ragnar package. Demonstration: scraping text from relevant links on a website and using RAG to ask about a unive...
Upon users' query, language learning models like ChatGPT and Deepseek can generate useful programming codes having been trained on many codes and texts available on the Internet. However, at least according to my experience the generated sc...
Join our workshop on From Model to Meaning: How to use the marginaleffects R package to interpret results from statistical or machine learning models, which is a part of our workshops for Ukraine series! Here’s some more info: Title: From Model to Meaning: How to use the marginaleffects R ... [Read more...]
The prevailing opinion is that because countries are exporting more goods to the US than it exports to them, resulting in a trade deficit. In return for their exports, these countries receive US dollars, which they often use to purchase US government bonds and stocks. Over time, this process contributes ...
Benchmarking memory usage in R
Profiling memory in R has never been a trivial task.
In this post, I would like to emphasize that currently popular methods are quite inaccurate and should therefore be used with caution. More importantly, they sh...
A Global Response to a Critical Need 🌍
When we launched the first Data Science for Open WASH Data (ds4owd) course in October 2023, 235 WASH professionals from 46 countries around the world signed up—validating our belief that this community is ...
In the plane to Bengaluru, I read through the book A modern introduction to probability and statistics, by Graham Upton—whose Measuring Animal Abundance I reviewed for CHANCE a while ago—, which is based on the earlier Understanding Statistics, written jointly with Ian Cook. (Not to be confused with A ...
I am something of a Dracula theme fanatic. After first discovering the open-source theme in RStudio, I had to see if it was supported by other applications. To my delight, I found custom Dracula themes for over 400 applications. Applying these th... [Read more...]
Join us for an open developer forum exploring how R and Rust can work together in bioinformatics and Bioconductor development. Organised and hosted by Lluís Revilla (Bioconductor Community Advisory Board), this online session will discuss techn...
When it comes to choosing an IDE to install there are lots of different choices out there, each with different pros and cons. This blog post isn’t about recommending or endorsing any specific IDE, but more about what questions you might want to a...
This is part four of a five part series of related posts on validating R packages.
Other posts in the series are:
Validation Guidelines
Package Popularity
Package Documentation
Code Quality (this post)
Maintenance
In this post, we’ll take...
The left side illustrates standard deviation as the spread of individual data values around the population mean (μ). The right side shows standard error as the variability in sample means (x̄) obtained from repeated sampling. Notice how the S...
We are SO excited to announce some massive news for the data.table community: data.table is now a NumFOCUS Sponsored Project!!!
For those who aren’t familiar, NumFOCUS is an incredible nonprofit organization that supports open source projects....
A monad is a very useful pattern. If you don’t know what it is, then read my
post from a few years ago, watch
this excellent video, and
you can also read J. Carroll’s insightful
post. Once you think you get
the concept, this blog post will ... [Read more...]
From April 23rd to 25th, 2025, we hosted the first Bioconductor microbiome course in Brazil, a three-afternoon workshop held at the Interunits Post-Graduate Program of Bioinformatics at the Federal University of Minas Gerais (UFMG) (link) i...