A new ggplot2-based implementation of control charts for the #RStats package qcc, and some lessons learned along the way.

Do not believe anything: what artists really do is to hang around all day (Paco de Lucia) Andy Warhol was mathematician. At least, he knew how clustering algorithms work. I am pretty sure of this after doing this experiment. First of all, let me introduce you to the breathtaking Grace Kelly: In my previous post

Quandl is a “wikipedia” for numerical data that allows you to search rapidly through 8 million ready-to-use data sets. At DataCamp we created a free in-browser coding tutorial on how to use the corresponding R package to access Quandl data from within R. As every real world data analyst knows, finding and formatting numerical data for

My last post I talked about using rCharts to create interactive graphics for my interview presentations. They seemed to go over pretty well in my interviews and helped me greatly as I did not need to remember or write down specific numbers to talk about. I use slidy to create my HTML slideshows and there was some...

Seventh Torino R net meeting on 27 Mar 2014, exceptionally hosted at Polo Universitario di Asti, will have three presentations: Processing and analysis methods for DNA methylation array data, Giovanni Fiorito, Complex Systems for Life Sciences, University of Turin; Temporal Dominance of Sensations (TDS) … Continue reading →

A historian, a data scientist, a programmer, a mathematician, and a philosopher discuss the question, how likely it is that a lottery draw (6 out of 49) contains two consecutive numbers. The historian The historian argues that from 1955 up to 2011, there were 5026 lottery draws in Germany, every Saturday, and from 2000 on, two draws every...

R square is a widely used measure of model fitness, in General Linear Models (GLM) it can be interpreted as the percent of variance in the response variable explained by the model. This measure is unitless which makes it useful to compare model between studies in meta-analysis analysis. Generalized Linear Mixed models (GLMM) are extending

e-mails with the latest R posts.

(You will not see this message again.)