Le Monde puzzle [#1132]

February 23, 2020
By
Le Monde puzzle [#1132]

A vaguely arithmetic challenge as Le weekly Monde current mathematical puzzle: Given two boxes containing x and 2N+1-x balls respectively. If one proceeds by repeatedly transferring half the balls from the even box to the odd box, what is the largest value of N for which the resulting sequence in one of the boxes covers

Read more »

Synthetic micro-datasets: a promising middle ground between data privacy and data analysis

February 22, 2020
By
Synthetic micro-datasets: a promising middle ground between data privacy and data analysis

Intro: the need for microdata, and the risk of disclosure Survey and administrative data are essential for scientific research, however accessing such datasets can be very tricky, or even impossible. In...

Read more »

The significance of the sector on the salary in Sweden, a comparison between different occupational groups, part 2

February 22, 2020
By
The significance of the sector on the salary in Sweden, a comparison between different occupational groups, part 2

In my last post, I examined the significance of the sector on the salary for different occupational groups using statistics from different regions. In previous posts I have shown...

Read more »

digest 0.6.25: Spookyhash bugfix

February 22, 2020
By

And a new version of digest is getting onto CRAN now, and to Debian shortly. digest creates hash digests of arbitrary R objects (using the md5, sha-1, sha-256, sha-512,...

Read more »

Nifty Upcoming Enhancements to unpack/to

February 22, 2020
By

We have some really nifty upcoming enhancements to wrapr unpack/to. One of the new notations is the use of := as an alternate assignment operator for unpack/to. This lets...

Read more »

Body Mass Index by @ellis2013nz

February 22, 2020
By
Body Mass Index by @ellis2013nz

BMI has an expectations management problem Body Mass Index (BMI) is an attempt to give a quick back-of-envelope answer to the question “if someone weighs W kg, is that a...

Read more »

RcppSimdJson 0.0.2: First Update!

February 22, 2020
By

Following up on the initial RcppSimdJson release, a first updated arrived on CRAN yesterday. RcppSimdJson wraps the fantastic simdjson library by Daniel Lemire which truly impressive. Via some very...

Read more »

R is turning 20 years old next Saturday. Here is how much bigger, stronger and faster it got over the years

February 22, 2020
By
R is turning 20 years old next Saturday. Here is how much bigger, stronger and faster it got over the years

Introduction It is almost the 29th of February 2020! A day that is very interesting for R, because it marks 20 years from the release of R v1.0.0, the first...

Read more »

relgam: Fitting reluctant generalized additive models

February 22, 2020
By
relgam: Fitting reluctant generalized additive models

I’m proud to announce that my latest research project, reluctant generalized additive modeling (RGAM), is complete (for now)! In this post, I give a brief overview of the method:...

Read more »

The next package release into AWS Athena

February 21, 2020
By
The next package release into AWS Athena

RBloggers|RBloggers-feedburner RAthena 1.7.1 and noctua 1.5.1 package versions have now been released to the CRAN. They both bring along several improvements with the connection to AWS Athena, noticeably the performance...

Read more »

Correlogram in R: how to highlight the most correlated variables in a dataset

February 21, 2020
By
Correlogram in R: how to highlight the most correlated variables in a dataset

Introduction Correlation matrix Correlogram Correlation test Code Photo by Pritesh Sudra Introduction Correlation, often computed as part of descriptive statistics, is a statistical tool used to study the relationship between two variables, ...

Read more »

Survey: What Degree is Best for Data Science?

February 21, 2020
By
Survey: What Degree is Best for Data Science?

  TL;DRJust answer 4 questions about best degree for Data Science here: https://www.surveymonkey.com/r/7FGGWS7 No doubt asking the question "What's the best degree for Data Science?" one won't expect unified or...

Read more »

BIMI Up, Scotty! A look at Brand Indicators for Message Identification (BIMI) Adoption with R and the Alexa Top 1m

February 21, 2020
By
BIMI Up, Scotty! A look at Brand Indicators for Message Identification (BIMI) Adoption with R and the Alexa Top 1m

It seems that the need for MX, DKIM, SPF, and DMARC records for modern email setups were just not enough acronyms (and setup tasks) for some folks, resulting in...

Read more »

R Community Explorer – Google Summer of Code Projects

February 21, 2020
By
R Community Explorer – Google Summer of Code Projects

By Benaiah Ubah, Claudia Vitolo and Rick Pack Introduction Google Summer of Code (GSoC) is an annual 3-month open-source software development (coding) program that provides a platform for mentors...

Read more »

Illuminating the Illuminated – Part Four: Tempora Mutantur | Changepoint Analysis of the Voynich Manuscript

February 21, 2020
By
Illuminating the Illuminated – Part Four: Tempora Mutantur | Changepoint Analysis of the Voynich Manuscript

Our past interrogation of the Voynich Manuscript has deconstructed its esoteric symbols into a form more suitable for our ends, subjected its statistical properties to comparison with more mundane...

Read more »

Tidy Discounted Cash Flow Analysis in R (for Company Valuation)

February 20, 2020
By
Tidy Discounted Cash Flow Analysis in R (for Company Valuation)

The tidy data principles are a cornerstone of financial data management and the data modeling workflow. The foundation for tidy data management is the tidyverse, a collection of R...

Read more »

rOpenSci’s Leadership in #rstats Culture

rOpenSci’s Leadership in #rstats Culture

At their closing keynote at the 2020 RStudio Conference, Hilary Parker and Roger Peng mentioned that they hatched the idea for their excellent Not So Standard Deviations podcast following...

Read more »

Rebalancing! Really?

February 20, 2020
By

In our last post, we introduced benchmarking as a way to analyze our hero’s investment results apart from comparing it to alternate weightings or Sharpe ratios. In this case,...

Read more »

A classification approach to predicting air crash survival

February 20, 2020
By
A classification approach to predicting air crash survival

Introduction Historically there have been several instance of air plane crashes. This study is an attempt to explore the possible causes of such air crashes, and to determine if air...

Read more »

Analysing tweets with the rtweet package

Analysing tweets with the rtweet package

This is a brief post on collecting and analysing tweets. I will show how to use the rtweet package to extract Twitter posts about the R community. This ties...

Read more »

DALEX v 1.0 and the Explanatory Model Analysis

February 20, 2020
By
DALEX v 1.0 and the Explanatory Model Analysis

The DALEX package version 1.0 CRAN release is scheduled for Feb 20. It brings lots of improvements and changes. Below I will briefly summarise how this package helps to...

Read more »

EARL Conference 2020 – Why YOU should Submit an Abstract

February 20, 2020
By

In 2014 we launched the EARL (Enterprise Application of the R Language) Conference aimed at connecting and inspiring business users... The post EARL Conference 2020 – Why YOU should Submit...

Read more »

Testing for a causal effect (with 2 time series)

February 19, 2020
By
Testing for a causal effect (with 2 time series)

A few days ago, I came back on a sentence I found (in a French newspaper), where someone was claiming that “… an old variable explains 85% of the...

Read more »

Aligning 2D NMR Spectra Part 1

Aligning 2D NMR Spectra Part 1

In one-dimensional \(^1\)H NMR spectroscopy, particularly biomolecular NMR, it is frequently necessary to align spectra before chemometric or metabolomics analysis. Poor alignment arises largely from pH and ionic strength...

Read more »

2 Months in 2 Minutes – rOpenSci News, February 2020

2 Months in 2 Minutes – rOpenSci News, February 2020

rOpenSci HQ On behalf of rOpenSci, thank you to everyone who has contributed their...

Read more »

Debugging: Signals and Subprocesses

February 19, 2020
By

This is a short story about a non-trivial bug in the processx package, and how I fixed it. It is a good showcase of the some debugging tools. The bug processx is...

Read more »

Circular regression trees and forests

February 19, 2020
By
Circular regression trees and forests

A flexible framework for probabilistic forecasting of circular data is introduced, using distributional regression trees and random forests based on the...

Read more »

Dynamic UI Elements in Shiny – Part 2

February 19, 2020
By
Dynamic UI Elements in Shiny – Part 2

Continuing our effort of applying the principles of reactivity to the UI part of a ShinyApp, this blog introduces two ways of conditionally rendering UI-elements in your app. Both...

Read more »

Data science trainings in Berlin & Hamburg

February 19, 2020
By
Data science trainings in Berlin & Hamburg

R is one of the leading programming languages for data analysis. In April and October 2020 we will bring our popular trainings “Introduction to R“ and “Machine Learning with...

Read more »

Search R-bloggers

Sponsors