## Compcache on Ubuntu on Amazon EC2

May 4, 2010
The following fully-automatic Bash script downloads, compiles, and initializes compcache version 0.6.2 on Ubuntu Karmic Koala (9.10). This script creates two swaps with a maximum of 4GB uncompressed size each. Two swaps are used to take advantage of 2 CPUs (or CPU cores in a multicore CPU). Compcache is a fascinating memory compression system. The

## Beautiful table outputs in R, part 2 #rstats #sjPlot

March 4, 2014
First of all, I’d like to thank my readers for the lots of feedback on my last post on beautiful outputs in R. I tried to consider all suggestions, updated the existing table-output-functions and added some new ones, which will be described in this post. The updated package is already available on CRAN. This posting

## Genetic data, large matrices and glmnet()

February 25, 2014
Recently talking to a colleague, had contact with a problem that I had never worked with before: modeling with genetic The post Genetic data, large matrices and glmnet() appeared first on Flavio Barros .

## Interactive exploration of a prior’s impact

February 21, 2014
The probably most frequent criticism of Bayesian statistics sounds something like “It’s all subjective – with the ‘right’ prior, you can get any result you want.”. In order to approach this criticism it has been suggested to do a sensitivity analysis (or robustness analysis), that demonstrates how the choice of priors affects the conclusions drawn

## Regression with multiple predictors

February 18, 2014
(This article was first published on Digithead's Lab Notebook, and kindly contributed to R-bloggers) Now that I'm ridiculously behind in the Stanford Online Statistical Learning class, I thought it would be fun to try to reproduce the figure on page 36 of the slides from chapter 3 or page 81 of the book. The result is a curvaceous surface...

## ggplot2: Cheatsheet for Visualizing Distributions

February 18, 2014
In the third and last of the ggplot series, this post will go over interesting ways to visualize the distribution of your data.

## Tutorials- Statistical and Multivariate Analysis for Metabolomics

February 17, 2014
I recently had the pleasure in participating in the 2014 WCMC Statistics for Metabolomics Short Course. The course was hosted by the NIH West Coast Metabolomics Center and focused on statistical and multivariate strategies for metabolomic data analysis. A variety of topics were covered using 8 hands on tutorials which focused on: data quality overview

## Unprincipled Component Analysis

February 10, 2014
As a data scientist I have seen variations of principal component analysis and factor analysis so often blindly misapplied and abused that I have come to think of the technique as unprincipled component analysis. PCA is a good technique often used to reduce sensitivity to overfitting. But this stated design intent leads many to (falsely)

## ShareLaTeX now supports knitr

January 31, 2014
ShareLaTeX (click here to register a free account) is a wonderful and reliable on-line editor for writing and compiling LaTeX documents “in the cloud” as well as working together in real-time (imagine Google Docs supporting LaTeX => you get ShareLaTeX).…Read more ›

$Computing and visualizing LDA in R$