The RAppArmor Package: Enforcing Security Policies in R Using Dynamic Sandboxing on Linux

November 14, 2013
An article called The RAppArmor Package: Enforcing Security Policies in R Using Dynamic Sandboxing on Linux has appeared in the latest volume of he Journal of Statistical Software: http://www.jstatsoft.org/v55/i07. The RAppArmor package is one of the foundations of the OpenCPU framework. It protects against malicious use and excessive use of hardware...

A small comparison of bio-equivalence calculations.

November 10, 2013
Last week I looked at two-way cross-over studies and followed the example of Schütz (http://bebac.at/) in the analysis. Since the EU has its on opinions (Questions & Answers: Positions on specific questions addressed to the pharmacokinetics working party) and two example data sets, I was wondering how the various computations compared.Data There...

Key Driver vs. Network Analysis in R

November 8, 2013
When marketing researchers speak of driver analysis, they are referring to an input-output model with overall satisfaction as the output and performance ratings of specific product and service components as the inputs. The causal model is straightforwa...

Failed Randomization In A Randomized Trial?

November 4, 2013
$Failed Randomization In A Randomized Trial?$

We will continue the saga of the three-arm clinical trial that is giving the editors of the prestigious journal The Spleen a run for their money. While the polls are gathering digital dust, let’s see if we can direct this discussion to a more quantitative path. To do so, we will ask (and answer) the

quantstrat is slow

November 4, 2013
The complaint I hear most frequently about quantstrat is that it's slow, especially for large data.  Some of this slow performance is due to quantstrat treating all strategies as path-dependent by default.  Path dependence requires rules to b...

Introduction to Feature selection for bioinformaticians using R, correlation matrix filters, PCA & backward selection

October 17, 2013
Bioinformatics is becoming more and more a Data Mining field. Every passing day, Genomics and Proteomics yield bucketloads of multivariate data (genes, proteins, DNA, identified peptides, structures), and every one of these biological data units are described by a number of features: length, physicochemical properties, scores, etc. Careful consideration of which features to select when trying...

Le Tour

October 16, 2013
Today I've given the talk on the model for structural zeros and the related R package BCEs0 for the third time in three weeks (this time it was at the London School of Hygiene and Tropical Medicine).Le Tour is going quite well, I think \$-\$ in...

Fearsome Engines Part 2: Innovations and new features

October 13, 2013
There are lots of R engines emerging! I’ve interviewed members of each of the teams involved in these projects. In part 1 of this series, we covered the motivation of each project. This part looks at the technical achievements and new features. Many of the innovations are performance improvements, reflecting the primary goal of several