266 search results for "PCA"

Plotting model fits

August 29, 2012
By
Plotting model fits

We all know that it is important to plot your data and explore the data visually to make sure you understand it. The same is true for your model fits. First, you want to make sure that the model is fitting...

Read more »

Why R for Mass Spectrometrist and Computational Proteomics

August 25, 2012
By
Why R for Mass Spectrometrist and Computational Proteomics

Why R:Actually, It is a common practice the integration of the statistical analysis of the resulted data and in silico predictions of the data generated in your manuscript and your daily research. Mass spectrometrist, biologist and bioinformaticians c...

Read more »

2014 Winter Olympics: Home Court Advantage – Russia

August 22, 2012
By
2014 Winter Olympics: Home Court Advantage – Russia

"Russia is a riddle wrapped in a mystery inside an enigma."  -- Winston Churchill, radio address in 1939 A couple of weeks ago, Graph of the Week published an article describing the significant improvement in medals won by the host...

Read more »

ARMA Models for Trading

August 21, 2012
By
ARMA Models for Trading

In this tutorial I am going to share my R&D and trading experience using the well-known from statistics Autoregressive Moving Average Model (ARMA). There is a lot written about these models, however, I strongly recommend Introductory Time Series with R, which I find is a perfect combination between light theoretical background and practical implementations in

Read more »

read raster data in parallel

August 18, 2012
By
read raster data in parallel

Use library(parallel) to read raster data in parallel fashion Use library(parallel) to read raster data in parallel fashion Recently, I have been doing some analysis for a project I am involved in. In particular, I was...

Read more »

Adventures at My First JSM (Joint Statistical Meetings) #JSM2012

August 6, 2012
By
Adventures at My First JSM (Joint Statistical Meetings) #JSM2012

During the past few decades that I have been in graduate school (no, not literally) I have boycotted JSM on the notion that “I am not a statistician.” Ok, I am a renegade statistician, a statistician by training. JSM 2012 was held in San Diego, CA, one of the best places to spend a week during the summer. This...

Read more »

Discriminating Between Iris Species

August 4, 2012
By
Discriminating Between Iris Species

The Iris data set is a famous for its use to compare unsupervised classifiers. The goal is to use information about flower characteristics to accurately classify the 3 species of Iris. We can look at scatter plots of the 4 variables in the data set and see that no single variable nor bivariate combination can achieve this. One approach to improve the separation

Read more »

Archetypal Analysis

July 30, 2012
By
Archetypal Analysis

Thinking Strategically about Customer HeterogeneityIronically, market segmentation, whose motto is "one size does not fit all," seems to rely almost exclusively on one definition of what constitutes a segment.  Borrowing its definition f...

Read more »

Color Palettes in RGB Space

June 20, 2012
By
Color Palettes in RGB Space

Introduction I've recently been interested in how to communicate information using color. I don't know much about the field of Color Theory, but it's an interesting topic to me. The selection of color palettes, in particular, has been a topic I've been faced with lately. I downloaded 18 different sequential color palettes from Cynthia Brewer's

Read more »

Estimation of hydraulic conductivity and its uncertainty from grain-size data using GLUE and artificial neural networks.

June 13, 2012
By
Estimation of hydraulic conductivity and its uncertainty from grain-size data using GLUE and artificial neural networks.

AbstractVarious approaches exist to relate saturated hydraulic conductivity (Ks) to grain-size data. Most methods use a single grain-size parameter and hence omit the information encompassed by the entire grain-size distribution. This study compares two data-driven modelling methods—multiple linear regression and artificial neural networks—that use the entire grain-size distribution data as input for Ks prediction. Besides the predictive capacity of the methods,...

Read more »