Decision making trees and machine learning resources for R

April 30, 2014
I have recently come across Ricky Ho's blog "Pragmatic Programming Techniques", which seems to be excellent resource for all sorts of aspects regarding data exploration and predictive modelling. The post "Six steps in data science" provides a nice overview to some of the topics covered in the blog. For some reason, this blog does not seem to be...

Mythbusting – Dr. Copper

April 21, 2014
Image by Justin Reznick   “An economist is an expert who will know tomorrow why the things he predicted yesterday didn't happen today.” Laurence J. Peter (author and creator of the Peter Principle) If you were paying attention to financial sites last month, you probably noticed a number of articles on “Dr. Copper”. Here is

Geomorph 3D Visualization

April 16, 2014
Dear geomorph users,version 2.0 of geomorph brings new developments in how shape deformations from 3D coordinate shape data can be viewed. We have implemented warping of 3D surface files (e.g., .ply files), which allows the user to visualize the shape deformations along Principal Component axes, Multivariate Regression slopes, Partial Least Squares axes and group differences, to name a few.The new function warpRefMesh() reads in a .ply...

Visualizing principal components with R and Sochi Olympic Athletes

March 27, 2014
Principal Components Analysis (PCA) is used as a dimensionality reduction method. Here we simply explain PCA step-by-step using data about Sochi Olympic Curlers. It is hard to visualize a high dimensional space. When I took linear algebra, the book and teachers spoke about it as if were easy to visualize a hyperspace, but...

sjPlot 1.3 available #rstats #sjPlot

March 27, 2014
I just submitted my package update (version 1.3) to CRAN. The download is already available (currently source, binaries follow). While the last two updates included new functions for table outputs (see here and here for details on these functions), the current update only provides small helper functions as new functions. The focus of this update

Beautiful table outputs in R, part 2 #rstats #sjPlot

March 4, 2014
First of all, I’d like to thank my readers for the lots of feedback on my last post on beautiful outputs in R. I tried to consider all suggestions, updated the existing table-output-functions and added some new ones, which will be described in this post. The updated package is already available on CRAN. This posting

Genetic data, large matrices and glmnet()

February 25, 2014
Recently talking to a colleague, had contact with a problem that I had never worked with before: modeling with genetic The post Genetic data, large matrices and glmnet() appeared first on Flavio Barros .

Interactive exploration of a prior’s impact

February 21, 2014
The probably most frequent criticism of Bayesian statistics sounds something like “It’s all subjective – with the ‘right’ prior, you can get any result you want.”. In order to approach this criticism it has been suggested to do a sensitivity analysis (or robustness analysis), that demonstrates how the choice of priors affects the conclusions drawn

Regression with multiple predictors

February 18, 2014
(This article was first published on Digithead's Lab Notebook, and kindly contributed to R-bloggers) Now that I'm ridiculously behind in the Stanford Online Statistical Learning class, I thought it would be fun to try to reproduce the figure on page 36 of the slides from chapter 3 or page 81 of the book. The result is a curvaceous surface...