Six Sigma DMAIC Series in R – Part 3

July 15, 2018
By
Six Sigma DMAIC Series in R – Part 3

Hope you liked the Part 1 and Part 2 of this Series. In this Part 3, we will go through the tools used during the analyze phase of Six Sigma DMAIC cycle. In this phase, available data is used to identify the key process inputs and their relation to the output. We will go through Related PostPrediction Interval, the...

Read more »

A thought experiment: How CRAN saved 3,620 (working) lives

July 15, 2018
By

Given the vast amount of R packages available today, it makes sense (at least to me, as a trained economist) to ask a simple yet difficult question: How much...

Read more »

Alleviating AWS Athena Aggravation with Asynchronous Assistance

July 14, 2018
By

I’ve blogged about how to use Amazon Athena with R before and if you are a regular Athena user, you’ve likely run into a situation where you prepare a...

Read more »

[Web2Day] Producing web content with R

July 13, 2018
By
[Web2Day] Producing web content with R

Earlier this week, my talk at the Web2Day conference was put online. Here is an english summary for those who don’t understand french :) Disclaimer: this talk has been...

Read more »

Why I rarely use apply

July 13, 2018
By
Why I rarely use apply

In this short post, I talk about why I’m moving away from using function apply. With matrices It’s okay to use apply with a dense matrix, although you can...

Read more »

Seasonal decomposition of short time series

July 13, 2018
By

Many users have tried to do a seasonal decomposition with a short time series, and hit the error “Series has less than two periods”. The problem is that the usual...

Read more »

Variable vs. Participant-wise Standardization

July 13, 2018
By
Variable vs. Participant-wise Standardization

The data Standardize Effect of Standardization At a general level At a participant level ...

Read more »

From Data to Viz | Find the graphic you need

July 13, 2018
By
From Data to Viz | Find the graphic you need

Read more »

Stencila – an office suite for reproducible research

July 13, 2018
By
Stencila – an office suite for reproducible research

Stencila launches the first version of its (open source) word processor and spreadsheet editor designed for researchers. By Michael Aufreiter, Substance, and Aleksandra Pawlik and Nokome Bentley, Stencila Stencila...

Read more »

Is GINA really about to die?!?

July 13, 2018
By
Is GINA really about to die?!?

Introduction During a recent negotiation of an informed consent form for use in a clinical trial, the opposing lawyer and I skirmished over the applicability of the Genetic Information...

Read more »

Introducing the Kernelheaping Package II

July 13, 2018
By
Introducing the Kernelheaping Package II

In the first part of Introducing the Kernelheaping Package I showed how to compute and plot kernel density estimates on rounded or interval censored data using the Kernelheaping package....

Read more »

Using Machine Learning for Causal Inference

July 13, 2018
By
Using Machine Learning for Causal Inference

Machine Learning (ML) is still an underdog in the field of economics. However, it gets more and more recognition in the recent years. One reason for being an underdog...

Read more »

Variational Gaussian Mixtures for Face Detection

July 12, 2018
By
Variational Gaussian Mixtures for Face Detection

Mixture model A Gaussian mixture model is a probabilistic way of representing subpopulations within an overall population. We only observe the data, not the subpopulation from which observation belongs. We have $N$ random...

Read more »

LIBD rstats club remote useR!2018 notes

LIBD rstats club remote useR!2018 notes

For our July 13th 2018 LIBD rstats club meeting we decided to check as much as we could the useR!2018 conference. Here’s what we were able to figure out...

Read more »

Announcing the R Markdown Book

July 12, 2018
By
Announcing the R Markdown Book

It is exciting for us to see the amazing growth of the R Markdown ecosystem over the four years since the creation of R Markdown in 2014. Now you...

Read more »

How to Aggregate Data in R

July 12, 2018
By

The process involves two stages. First, collate individual cases of raw data together with a grouping variable. Second, perform which calculation you want on each...

Read more »

New open data sets from Microsoft Research

July 12, 2018
By

Microsoft has released a number of data sets produced by Microsoft Research and made them available for download at Microsoft Research Open Data. The Datasets in Microsoft Research Open...

Read more »

Coloured output in the R console

July 12, 2018
By
Coloured output in the R console

Just a little fun today… the R console isn’t the most interesting of things… text is typically either black or red (assuming default settings in RStudio). There’s a package...

Read more »

New Version of Choroplethr.com!

July 12, 2018
By
New Version of Choroplethr.com!

Last year I decided to take out the domain Choroplethr.com. I used it to host information about Choroplethr, my suite of R packages for mapping... The post New Version of...

Read more »

Where do I learn about log_sum_exp, log1p, lccdf, and other numerical analysis tricks?

July 12, 2018
By

Richard McElreath inquires: I was helping a colleague recently fix his MATLAB code by using log_sum_exp and log1m tricks. The natural question he had was, “where do you learn...

Read more »

The Mathematics Genealogy Project: Customizing my mathematical family tree

July 12, 2018
By
The Mathematics Genealogy Project: Customizing my mathematical family tree

Some time ago, Maëlle Salmon published a very nice post showing how she scraped her mathematical family tree from the Mathematics Genealogy Project. Of course I immediately wanted to...

Read more »

Finalfit now includes bootstrap simulation for model prediction

July 12, 2018
By
Finalfit now includes bootstrap simulation for model prediction

If your new to modelling in R and don’t know what this title means, you definitely want to look into doing it. I’ve always been a fan of converting...

Read more »

My upcoming conference talks & workshops: M-cubed, ML Summit & data2day

July 11, 2018
By
My upcoming conference talks & workshops: M-cubed, ML Summit & data2day

I’ll be giving talks and workshops at the following three upcoming conferences; hope to meet some of you there! From 15th to 17th October 2018, I’ll be in London for...

Read more »

phylogram: dendrograms for evolutionary analysis

phylogram: dendrograms for evolutionary analysis

Evolutionary biologists are increasingly using R for building, editing and visualizing phylogenetic trees. The reproducible code-based workflow and comprehensive array of tools available in packages such as ape, phangorn and phytools make R an...

Read more »

Four Ways to Write Better Stan Code

July 11, 2018
By
Four Ways to Write Better Stan Code

1. Improve sampler efficiency by picking the right model We need to address how we specify our models before even discussing writing code that is...

Read more »

John Mount speaking on rquery and rqdatatable

July 11, 2018
By
John Mount speaking on rquery and rqdatatable

rquery and rqdatatable are new R packages for data wrangling; either at scale (in databases, or big data systems such as Apache Spark), or in-memory. The speed up both...

Read more »

New Course: Python for R Users

July 11, 2018
By
New Course: Python for R Users

Here is the course link. Course Description Python and R have seen immense growth in popularity in the "Machine Learning Age". They both are high-level languages that are easy to learn...

Read more »

[ggplot2] Welcome viridis !

July 11, 2018
By
[ggplot2] Welcome viridis !

Let’s welcome the viridis palette into the new version of {ggplot2}! Viri-what ? viridis is one of the favorite color palettes of one of the member of the team...

Read more »

Data Science For Business: 3 Reasons You Need To Learn The Expected Value Framework

Data Science For Business: 3 Reasons You Need To Learn The Expected Value Framework

One of the most difficult and most critical parts of implementing data science in business is quantifying the return-on-investment or ROI. As a data scientist in an organization, it’s...

Read more »

Search R-bloggers


Sponsors

Mango solutions





mckinsey.com global hackathon

Zero Inflated Models and Generalized Linear Mixed Models with R



datasciencego.com

Quantide: statistical consulting and training

ODSC2 west

ODSC1_london

datasociety

http://www.eoda.de

max kuhn









Six Sigma Online Training



mljar.com

computationalanalytics.com

Our ads respect your privacy. Read our Privacy Policy page to learn more.

Contact us if you wish to help support R-bloggers, and place your banner here.