Getting back Into It
It’s been a while for this website; a lot of my work has been used on projects that can’t be easily shared here. That said, I have been at the Edinburgh Fringe this month supporting my wife, and I wanted to use a Tidy ...
Cloud providers are no longer just offering traditional x86-based servers,
ARM-based servers are now becoming a serious alternative. And for good reason:
they’re often far more cost-effective and power-efficient than their x86
counterparts. For ...
1 Introduction
Data preprocessing is a cornerstone of any data analysis or machine learning pipeline. Raw data rarely comes in a form ready for direct analysis — it often requires cleaning, transformation, normalization, and careful handl...
Motivation
Epidemiological delays inform about the time between two well-defined events related to a disease. The serial interval (SI) of an infectious disease is defined as the time between symptom onset in a primary case (infector) and symptom onset in a secondary case (infectee). It is a widely used epidemiological ...
1 Introduction
Missing data is one of the most common challenges in data analysis and statistical modeling.
Whether the data originates from surveys, administrative registers, or clinical trials, it is almost inevitable that some values are abs...
Bernstein has conducted an analysis of the U.S. supply and demand balance in analog and discrete semiconductors, particularly in light of the potential introduction of Section 232 tariffs. The analysis focuses on the implications for major companies, including Texas Instruments, Analog Devices, Infineon Technologies and Renesas. According to analysts led ...
This time, I will do an absurd amount of useless conversions 🙂 Some might make some sense – in the long looong run, but most will for sure have none. It all started with coffee 🙂 and the initial question…Read more ›
Wikipedia donations style note: Because of delays with my scholarship payment, if this post is useful to you I kindly ask a minimal donation on Buy Me a Coffee. It shall be used to continue my Open Source efforts. The full explanation is here: A ...
Back in 2018, there was a survey on Gallup, about honesty and ethical standards, per profession More than four in five Americans (84%) again rate the honesty and ethical standards of nurses as “very high” or “high,” earning them the top spot among a diverse list of professions for the 17th consecutive ...
Most R training courses follow a standardized approach that may not align with your actual work requirements. This 1:1 training program addresses that gap by building each session around your specific datasets, questions, and objectives.
The R or... [Read more...]
Instead of flashcards, we Rube Goldberg’d this with Bioconductor! Analyzed 3,280 E. coli genomes from NCBI, detecting ESBL genes in 84.4% of samples. CTX-M-15 was most common. Helped us understand gene nomenclature and sequence analysis! 📊🔬
Motivation
I’ve always had a hard time learning and remembering all these genes for antimicrobial ...
Introduction
When teaching, for my practicals/tutorials and for about half of my lectures I find myself preparing them using R Markdown and laterly Quarto. I enjoy preparing the material in R Markdown and Quarto because it gives me a reproducible way o...
Below are the slides for my Futureverse P2P: Peer-to-Peer Parallelization in R talk that I presented at the useR! 2025 conference at Duke University, Durham, North Carolina, United States.
Title: Futureverse P2P: Peer-to-Peer Parallelization i...
Author’s introduction
This post grew out of a rambling, sporadically multi-month play with a bunch of data that became far too big for a single post for any plausible audience. So I have broken that work into three artefacts that might be of interest ...
We just arrived in Kyoto (京都), Japan, from Montréal, Canada. Everything seems very different. I still have in mind the time I spent in Hong Kong (more than a year), but that was 25 years ago… Just to compare, I used wikipedia’s page of Kyoto vs. Montréal. Quite naturaly, ...
We are thrilled to announce the release of {rtflite} 1.0.0, marking a significant milestone in bringing production-ready TLF generation capabilities in RTF format to Python for clinical trial reporting. This major release represents our commit...