## Impact of correlated predictions on the variance of an ensemble model

August 21, 2014
$Impact of correlated predictions on the variance of an ensemble model$

Let and be the prediction errors of two statistical/machine learning algorithms. and have relatively low bias, and high variances and . They are also correlated, having a Pearson correlation coefficient equal to . Aggregating models and might result in a … Continue reading →

## Extracting Latent Variables from Rating Scales: Factor Analysis vs. Nonnegative Matrix Factorization

August 21, 2014
For many of us, factor analysis provides a gateway to learning how to run and interpret nonnegative matrix factorization (NMF). This post will analyze a set of ratings on a 218 item adjective checklist using both principal axis factor analysis and NMF....

## CRAN release jsonlite 0.9.10 (RC)

August 19, 2014
The jsonlite package is a JSON parser/generator optimized for the web. It implements a bidirectional mapping between JSON data and the most important R data types. This is very powerful for interacting with web APIs, or to build pipelines where...

## Changes to FSA — Estimating Abundance

August 17, 2014
I mentioned previously, that I have been updating the Mark-Recapture vignettes.  That has morphed into a document that is an update of the Mark-Recapture Closed and Open vignettes and the depletion/removal vignettes and associated FSA functions.  Some of the changes … Continue reading →

August 17, 2014
Navigation gets you from where you are to where you want to be. Speaking of navigation, you can jump to selected sections of this post: Navigation; R-bloggers; Task views; Rdocumentation.org; sos package; ??; apropos; ls; methods; getAnywhere; :::; find; args; grep; %in%; str; getwd; file.choose; Spyglass summary; browser; See also. Overview Figure 1: A map The post

## Easier way to chain commands using Pipe function

August 15, 2014
In pipeR 0.4 version, one of the new features is Pipe() function. The function basically creates a Pipe object that allows command chaining with \$, and thus makes it easier to perform operations in pipeline without any external operator. In this post, I will introduce how to use this function and some basic knowledge about how it works. But before...

## Reasonable Inheritance of Cluster Identities in Repetitive Clustering

August 15, 2014
… or Inferring Identity from Observations Let’s assume the following application: A conservation organisation starts a project to geographically catalogue the remaining representatives of an endangered plant species. For that purpose hikers are encouraged to communicate the location of the plant … Continue reading →

## Propensity Modeling, Causal Inference, and Discovering Drivers of Growth

August 14, 2014
Imagine you just started a job at a new company. You watched World War Z recently, so you're in a skeptical mood, and given that your last two startups failed from what you believe to be a lack of data, you're giving everything an extra critical eye. You start by thinking about the impact of the sales team. How...

## Winston Chang’s “Interactive Graphics with ggvis” at useR! 2014

August 14, 2014
Winston Chang’s “Interactive Graphics with ggvis” talk at useR! 2014 was in many ways a...