# Monthly Archives: March 2013

## Distribution of T-Scores

March 2, 2013
Like most of my post these code snippets derive from various other projects.  In this example it shows a simulation of how one can determine if a set of t statistics are distributed properly.  This can be useful when sampling known populations (e.g. U.S. census or hospital populations) or populations that will soon be known

## PCA to PLS modeling analysis strategy for WIDE DATA

March 2, 2013
Working with wide data is already hard enough, add to this row outliers and things can get murky fast. Here is an example of an anlysis of a wide data set, 24 rows  x 84 columns. Using imDEV, written in R, to calculate and visualize a principal components analysis (PCA) on this data set. We find that

March 2, 2013
About This BlogMy name is Isaac and I'm a Ph.D. student in Clinical Psychology. Why am I writing about fantasy football and data analysis? Because fantasy football involves the intersection of two things I love: sports and statistics. With this blog, I...

## Percentage Winner

March 2, 2013
I know, it has been often said by people, much brighter and more competent that I will ever be. The least important figure to look at are the Percentage Winner.Personally I find it very challenging and difficult to follow a system - or even provide see...

## PowerBuilder and R get together

The other day I was thinking about writing a blog using PowerBuilder, but couldn't decide which one other technology I should integrate it...of course...R came to my mind...My journey started around 4 days ago...when I start looking for ways to call R ...

March 2, 2013
A new Armadillo version 3.800.0 is now out. Conrad picked a new numbering scheme to coincide with the relicensing from LGPL to MPL 2.0. The new version 0.3.800.0 of the corresponding RcppArmadillo package (which still uses GPL 2 or later) is now on ...

## Adding Labels to Points in a Scatter Plot in R

What’s the Scatter? A scatter plot displays the values of 2 variables for a set of data, and it is a very useful way to visualize data during exploratory data analysis, especially (though not exclusively) when you are interested in the relationship between a predictor variable and a target variable.  Sometimes, such data come with categorical

## The options mechanism in R

March 2, 2013
Customization in R. Basics Several features benefit from being customizable — either because of personal taste or specifics of the environment. The way R implements this flexibility is through the options function.  This both sets and reports options.  For example, we can see the names of the options that are set by default: > names(options()) The post The...

## making a random walk geometrically ergodic

March 1, 2013
While a random walk Metropolis-Hastings algorithm cannot be uniformly ergodic in a general setting (Mengersen and Tweedie, AoS, 1996), because it needs more energy to leave far away starting points, it can be geometrically ergodic depending on the target (and the proposal). In a recent Annals of Statistics paper, Leif Johnson and Charlie Geyer designed

## Tools for making a paper

March 1, 2013
Since it seems to be the fashion, here’s a post about how I make my academic papers. Actually, who am I trying to kid? This is also about how I make slides, letters, memos and “Back in 10 minutes” signs to pin on the door. Nevertheless it’s for making academic papers that I’m going to