Blog Archives

A Gentle Introduction to tidymodels

June 18, 2019
By
A Gentle Introduction to tidymodels

Recently, I had the opportunity to showcase tidymodels in workshops and talks. Because of my vantage point as a user, I figured it would be valuable to share what I have learned so far. Let’s begin by framing where tidymodels fits in our analysis projects. The diagram above is based on the R for Data Science book, by Wickham and...

Read more »

Equal Size kmeans

June 12, 2019
By
Equal Size kmeans

We were recently presented with a problem where the decision maker wanted to understand how their data would naturally group together. The classic technique of k-means clustering was a natural choice; it’s well known, computationally efficient, and implemented in base R via the kmeans() function. Our problem has a slight wrinkle: the decision maker wished to see the data grouped...

Read more »

reticulate, virtualenv, and Python in Linux

June 9, 2019
By
reticulate, virtualenv, and Python in Linux

Roland Stevenson is a data scientist and consultant who may be reached on Linkedin. reticulate is an R package that allows us to use Python modules from within RStudio. I recently found this functionality useful while trying to compare the results of different uplift models. Though I did have R’s uplift package producing Qini charts and metrics, I also wanted...

Read more »

Introducing DeclareDesign, a Platform for Research Design

June 3, 2019
By
Introducing DeclareDesign, a Platform for Research Design

Graeme Blair is an Assistant Professor of Political Science at UCLA. Jasper Cooper is a Postdoctoral Research Associate at the Kahneman-Treisman Center for Behavioral Science and Public Policy at Princeton University. Alexander Coppock is an Assistant Professor of Political Science at Yale University. Macartan Humphreys is a Professor of Political Science at Columbia University and a Director of the research group “Institutions...

Read more »

April 2019: “Top 40” New CRAN Packages

May 29, 2019
By
April 2019: “Top 40” New CRAN Packages

One hundred eighty-seven new packages made it to CRAN in April. Here are my picks for the “Top 40”, organized into ten categories: Biotechnology, Data, Econometrics, Machine Learning, Medicine, Science, Statistics, Time Series, Utilities, and Visualization. Biotechnology genpwr v1.00: Provides functions for power and sample size calculations for genetic association studies allowing for mis-specification of the model of genetic susceptibility....

Read more »

Momentum Investing with R

May 28, 2019
By
Momentum Investing with R

After an extended hiatus, Reproducible Finance is back! We’ll celebrate by changing focus a bit and coding up an investment strategy called Momentum. Before we even tiptoe in that direction, please note that this is not intended as investment advice and it’s not intended to be a script that can be implemented for trading. The goal is to explore...

Read more »

Analysing the HIV pandemic, Part 4: Classification of lab samples

May 22, 2019
By
Analysing the HIV pandemic, Part 4: Classification of lab samples

Andrie de Vries is the author of “R for Dummies” and a Solutions Engineer at RStudio Phillip (Armand) Bester is a medical scientist, researcher, and lecturer at the Division of Virology, University of the Free State, and National Health Laboratory Service (NHLS), Bloemfontein, South Africa In this post we complete our series on analysing the HIV pandemic in Africa. Previously we...

Read more »

Analysing the HIV pandemic, Part 3: Genetic diversity

May 15, 2019
By
Analysing the HIV pandemic, Part 3: Genetic diversity

Phillip (Armand) Bester is a medical scientist, researcher, and lecturer at the Division of Virology, University of the Free State, and National Health Laboratory Service (NHLS), Bloemfontein, South Africa Andrie de Vries is the author of “R for Dummies” and a Solutions Engineer at RStudio Recap In part 2 of this series, we discussed the PhyloPi pipeline for conducting routine HIV phylogenetics...

Read more »

Virtual Morel Foraging with R

May 12, 2019
By
Virtual Morel Foraging with R

Bryan Lewis is a mathematician, R developer and mushroom forager.                               Morchella Americana by Bryan W. Lewis, see https://ohiomushroomsociety.wordpress.com/ It’s that time of year again, when people in the Midwestern US go nuts for morel mushrooms. Although fairly common in Western Pennsylvania, Ohio, Indiana, Illinois, Wisconsin, and, especially, Michigan1, they can still be tricky to find due to the vagaries of weather and mysteries...

Read more »

Analysing the HIV pandemic, Part 2: Drug resistance testing

May 6, 2019
By
Analysing the HIV pandemic, Part 2: Drug resistance testing

Phillip (Armand) Bester is a medical scientist, researcher, and lecturer at the Division of Virology, University of the Free State, and National Health Laboratory Service (NHLS), Bloemfontein, South Africa Dominique Goedhals is a pathologist, researcher, and lecturer at the Division of Virology, University of the Free State, and National Health Laboratory Service (NHLS), Bloemfontein, South Africa Andrie de Vries is the...

Read more »

Search R-bloggers

Sponsors

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)