Exploring correlations in R with corrr

@drsimonj here to share a (sort of) readable version of my presentation at the amst-R-dam meetup on 14 August, 2018: “Exploring correlations in R with corrr”. Those who attended will know that I changed the topic of the talk, originally advertised as “R from academia to commerical … Continue reading

Guide to tidy git analysis

@drsimonj here to help you embark on git repo analyses! Ever wondered who contributes to git repos? How their contributions have changed over time? What sort of conventions different authors use in their commit messages? Maybe you were inspired by Mara Averick to contribute to tidyverse packages … Continue reading

Five tips to improve your R code

@drsimonj here with five simple tricks I find myself sharing all the time with fellow R users to improve their code! This post was originally published on DataCamp’s community as one of their top 10 articles in 2017  1. More fun to sequence from 1 Next time you use the colon operator to … Continue reading

Big Data Solutions: A/B t test

@drsimonj here to share my code for using Welch’s t-test to compare group means using summary statistics.  Motivation I’ve just started working with A/B tests that use big data. Where once I’d whimsically run t.test(), now my data won’t fit into memory! I’m … Continue reading

A tidy model pipeline with twidlr and broom

@drsimonj here to show you how to go from data in a data.frame to a tidy data.frame of model output by combining twidlr and broom in a single, tidy model pipeline.  The problem Different model functions take different types of inputs (data.frames, matrices, etc) and produce different types of … Continue reading

Pretty scatter plots with ggplot2

@drsimonj here to make pretty scatter plots of correlated variables with ggplot2! We’ll learn how to create plots that look like this:  Data In a data.frame d, we’ll simulate two correlated variables a and b of length n: set.seed(170513) n <- 200 d <- data.frame(a = … Continue reading

Pretty histograms with ggplot2

@drsimonj here to make pretty histograms with ggplot2! In this post you’ll learn how to create histograms like this:  The data Let’s simulate data for a continuous variable x in a data frame d: set.seed(070510) d <- data.frame(x = rnorm(2000)) head(d) #> … Continue reading