May 2020

In defence of the 95% CI

May 11, 2020 | R on easystats

TLDR: BayestestR currently uses a 89% threshold by default for Credible Intervals (CI). Should we change that? If so, by what? Join the discussion here. Magical numbers, or conventional thresholds, have bad press in statistics, and there are many of th... [Read more...]

To stratify or not? It might not actually matter…

May 11, 2020 | Keith Goldfeld

Continuing with the theme of exploring small issues that come up in trial design, I recently used simulation to assess the impact of stratifying (or not) in the context of a multi-site Covid-19 trial with a binary outcome. The investigators are concerned that baseline health status will affect the probability ...
[Read more...]

Power Analysis by Data Simulation in R – Part II

May 11, 2020 | R | Julian Quandt

The Power Analysis by simulation in R for really any design - Part II Simulating a between-subjects t-test Simulating a within-subject t-test Using a one-sample t-test approach Using a correlated-samples paired t-test approach Summary: Our first simulations with t-tests Footnotes Click HERE to download the .Rmd file This blog is ...
[Read more...]

Visualizing Big MT Cars with Python plotnine-Part 2

May 11, 2020 | R on Redwall Analytics

# R Libraries
library("reticulate")

knitr::opts_chunk$set(
  fig.width = 15,
  fig.height = 8,
  out.width = '100%')
# Choose Python 3.7 miniconda
reticulate::use_condaenv(
  condaenv = "r-reticulate",
  required = TRUE
  )
# Install Python packages
lapply(c("plotnine"), function(package) {
       conda_install("r-reticulate", package, pip = TRUE)
})
# Python libraries
from datatable import *
import numpy as np
import plotnine as p9 
import re
Introduction In this post, we start out where we left off in Exploring Big MT Cars with Python datatable and plotnine-Part 1. In the chunk below, we load our cleaned up big MT Cars data set in order to be able to refer directly to the variable ...
[Read more...]

The USMS ePostal Over the Last 20+ Years

May 11, 2020 | Swimming + Data Science

In a previous post we discussed the 2020 USMS ePostal results, and hypothesized that declines in average distances swum by older age groups are caused by higher proportions of weaker swimmers participating in older age groups, rather than age based declines. We also mentioned how USMS epostal results are observational data, ...
[Read more...]

In defence of the 95% CI

May 11, 2020 | R on easystats

TLDR: BayestestR currently uses a 89% threshold by default for Credible Intervals (CI). Should we change that? If so, by what? Join the discussion here. Magical numbers, or conventional thresholds, have bad press in statistics, and there are many of...
[Read more...]

Call existing R code through functions

May 11, 2020 | Quantargo Blog

When you write code, functions are your best friends. They can make hard things very easy or provide new functionality in a nice way. Through functions you gain access to all the powerful features R has to offer. Call functions with function names an...
[Read more...]

How to Write a Git Commit Message, in 7 Steps

May 11, 2020 | Paul van der Laken

Version control is an essential tool for any software developer. Hence, any respectable data scientist has to make sure his/her analysis programs and machine learning pipelines are reproducible and maintainable through version control. Often, we use git for version control. If you don’t know what git is yet, ... [Read more...]

covid19italy v0.2.0 is now on CRAN

May 10, 2020 | Rami Krispin

Last week I pushed an update of the covid19italy package to CRAN (v0.2.0). The covid19italy R package provides a tidy format dataset of the 2019 Novel Coronavirus COVID-19 (2019-nCoV) pandemic outbreak in Italy. The package includes the following three datasets: italy_total - daily summary of the outbreak on ... [Read more...]

New Package, Pinboardr

May 10, 2020 | Roel M. Hogervorst

I’ve created a new package to interact with pinboard not to be confused with pinterest. I noticed there wasn’t a package yet and the API is fairly clear. So come and check it out {pinboardr} at https://github.com/RMHogervorst/pinboardr I did see a new package to ...
[Read more...]

covid19italy v0.2.0 is now on CRAN

May 10, 2020 | Rami Krispin

Last week I pushed an update of the covid19italy package to CRAN (v0.2.0). The covid19italy R package provides a tidy format dataset of the 2019 Novel Coronavirus COVID-19 (2019-nCoV) pandemic outbreak in Italy. The package includes the following three datasets: italy_total - daily summary of the outbreak on ... [Read more...]
1 10 11 12 13 14 17

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)