The p-direction: A Bayesian equivalent of the p-value?

February 25, 2020
By
The p-direction: A Bayesian equivalent of the p-value?

The Bayesian framework is powerful and allows for an incredible amount of flexibility and control over your analysis. That being said, newcomers often struggle with a lot of new concepts and tools and could benefit from some familiar grounding. And the p-value is a very familiar index (although paradoxically often misunderstood, but that’s another topic). Is there an equivalent of...

Read more »

New xgboost defaults

February 25, 2020
By
New xgboost defaults

xgboost is the most famous R package for gradient boosting and it is since long time on the market. In one of my publications, I created a framework for...

Read more »

3 recommended books on learning R

February 24, 2020
By
3 recommended books on learning R

I sometimes get asked how I got started learning R. I thought I would use this post to go through a few books I read along the way which...

Read more »

R Robustreg Package Downloads

February 24, 2020
By
R Robustreg Package Downloads

I built robustreg in 2006 and at the time the major stat packages did not have a robust regression available.  Below are graphs of weekly and cumulative downloads from...

Read more »

RStudio 1.3 Preview: Integrated Tutorials

February 24, 2020
By
RStudio 1.3 Preview: Integrated Tutorials

This blog post is part of a series on new features in RStudio 1.3, currently available as a preview release. We’re excited to announce that RStudio v1.3 will gain a...

Read more »

opentripplanner: Fast and Easy Multimodal Trip Planning in R with OpenTripPlanner

With services like Google Maps, finding the fastest route from A to B has become quick, cheap, and easy. Not just for driving but walking, cycling and public transport...

Read more »

multiplying the bars

February 24, 2020
By

The latest Riddler makes the remark that the expression |-1|-2|-3| has no unique meaning (and hence value) since it could be | -1x|-2|-3 | = 5   or   |-1| –...

Read more »

January 2020: “Top 40” New R Packages

February 23, 2020
By
January 2020: “Top 40” New R Packages

One hundred forty-seven new packages made it to CRAN in January. Here are my “Top 40” picks in nine categories: Computational Methods, Genomics, Machine Learning, Mathematics, Medicine, Statistics, Time...

Read more »

Le Monde puzzle [#1132]

February 23, 2020
By
Le Monde puzzle [#1132]

A vaguely arithmetic challenge as Le weekly Monde current mathematical puzzle: Given two boxes containing x and 2N+1-x balls respectively. If one proceeds by repeatedly transferring half the balls...

Read more »

Synthetic micro-datasets: a promising middle ground between data privacy and data analysis

February 22, 2020
By
Synthetic micro-datasets: a promising middle ground between data privacy and data analysis

Intro: the need for microdata, and the risk of disclosure Survey and administrative data are essential for scientific research, however accessing such datasets can be very tricky, or even impossible. In...

Read more »

The significance of the sector on the salary in Sweden, a comparison between different occupational groups, part 2

February 22, 2020
By
The significance of the sector on the salary in Sweden, a comparison between different occupational groups, part 2

In my last post, I examined the significance of the sector on the salary for different occupational groups using statistics from different regions. In previous posts I have shown...

Read more »

digest 0.6.25: Spookyhash bugfix

February 22, 2020
By

And a new version of digest is getting onto CRAN now, and to Debian shortly. digest creates hash digests of arbitrary R objects (using the md5, sha-1, sha-256, sha-512,...

Read more »

Nifty Upcoming Enhancements to unpack/to

February 22, 2020
By

We have some really nifty upcoming enhancements to wrapr unpack/to. One of the new notations is the use of := as an alternate assignment operator for unpack/to. This lets...

Read more »

Body Mass Index by @ellis2013nz

February 22, 2020
By
Body Mass Index by @ellis2013nz

BMI has an expectations management problem Body Mass Index (BMI) is an attempt to give a quick back-of-envelope answer to the question “if someone weighs W kg, is that a...

Read more »

RcppSimdJson 0.0.2: First Update!

February 22, 2020
By

Following up on the initial RcppSimdJson release, a first updated arrived on CRAN yesterday. RcppSimdJson wraps the fantastic simdjson library by Daniel Lemire which truly impressive. Via some very...

Read more »

R is turning 20 years old next Saturday. Here is how much bigger, stronger and faster it got over the years

February 22, 2020
By
R is turning 20 years old next Saturday. Here is how much bigger, stronger and faster it got over the years

Introduction It is almost the 29th of February 2020! A day that is very interesting for R, because it marks 20 years from the release of R v1.0.0, the first...

Read more »

relgam: Fitting reluctant generalized additive models

February 22, 2020
By
relgam: Fitting reluctant generalized additive models

I’m proud to announce that my latest research project, reluctant generalized additive modeling (RGAM), is complete (for now)! In this post, I give a brief overview of the method:...

Read more »

The next package release into AWS Athena

February 21, 2020
By
The next package release into AWS Athena

RBloggers|RBloggers-feedburner RAthena 1.7.1 and noctua 1.5.1 package versions have now been released to the CRAN. They both bring along several improvements with the connection to AWS Athena, noticeably the performance...

Read more »

Correlogram in R: how to highlight the most correlated variables in a dataset

February 21, 2020
By
Correlogram in R: how to highlight the most correlated variables in a dataset

Introduction Correlation matrix Correlogram Correlation test Code Photo by Pritesh Sudra Introduction Correlation, often computed as part of descriptive statistics, is a statistical tool used to study the relationship between two variables, ...

Read more »

Survey: What Degree is Best for Data Science?

February 21, 2020
By
Survey: What Degree is Best for Data Science?

  TL;DRJust answer 4 questions about best degree for Data Science here: https://www.surveymonkey.com/r/7FGGWS7 No doubt asking the question "What's the best degree for Data Science?" one won't expect unified or...

Read more »

BIMI Up, Scotty! A look at Brand Indicators for Message Identification (BIMI) Adoption with R and the Alexa Top 1m

February 21, 2020
By
BIMI Up, Scotty! A look at Brand Indicators for Message Identification (BIMI) Adoption with R and the Alexa Top 1m

It seems that the need for MX, DKIM, SPF, and DMARC records for modern email setups were just not enough acronyms (and setup tasks) for some folks, resulting in...

Read more »

R Community Explorer – Google Summer of Code Projects

February 21, 2020
By
R Community Explorer – Google Summer of Code Projects

By Benaiah Ubah, Claudia Vitolo and Rick Pack Introduction Google Summer of Code (GSoC) is an annual 3-month open-source software development (coding) program that provides a platform for mentors...

Read more »

Illuminating the Illuminated – Part Four: Tempora Mutantur | Changepoint Analysis of the Voynich Manuscript

February 21, 2020
By
Illuminating the Illuminated – Part Four: Tempora Mutantur | Changepoint Analysis of the Voynich Manuscript

Our past interrogation of the Voynich Manuscript has deconstructed its esoteric symbols into a form more suitable for our ends, subjected its statistical properties to comparison with more mundane...

Read more »

Tidy Discounted Cash Flow Analysis in R (for Company Valuation)

February 20, 2020
By
Tidy Discounted Cash Flow Analysis in R (for Company Valuation)

The tidy data principles are a cornerstone of financial data management and the data modeling workflow. The foundation for tidy data management is the tidyverse, a collection of R...

Read more »

rOpenSci’s Leadership in #rstats Culture

rOpenSci’s Leadership in #rstats Culture

At their closing keynote at the 2020 RStudio Conference, Hilary Parker and Roger Peng mentioned that they hatched the idea for their excellent Not So Standard Deviations podcast following...

Read more »

Rebalancing! Really?

February 20, 2020
By

In our last post, we introduced benchmarking as a way to analyze our hero’s investment results apart from comparing it to alternate weightings or Sharpe ratios. In this case,...

Read more »

A classification approach to predicting air crash survival

February 20, 2020
By
A classification approach to predicting air crash survival

Introduction Historically there have been several instance of air plane crashes. This study is an attempt to explore the possible causes of such air crashes, and to determine if air...

Read more »

Analysing tweets with the rtweet package

Analysing tweets with the rtweet package

This is a brief post on collecting and analysing tweets. I will show how to use the rtweet package to extract Twitter posts about the R community. This ties...

Read more »

DALEX v 1.0 and the Explanatory Model Analysis

February 20, 2020
By
DALEX v 1.0 and the Explanatory Model Analysis

The DALEX package version 1.0 CRAN release is scheduled for Feb 20. It brings lots of improvements and changes. Below I will briefly summarise how this package helps to...

Read more »

Search R-bloggers

Sponsors