255 search results for "PCA"

Global Temperature Proxy Reconstructions ~ Bayesian extrapolation of warming w/ rjags

August 22, 2010
By
Global Temperature Proxy Reconstructions ~ Bayesian extrapolation of warming w/ rjags

Update: fixed projection. There are a bunch of “hockey sticks” that calculate past global temps. through the use of proxies when instrumental data is absent. There is a new one out there by McShane and Wyner (2010) that’s creating quite a stir in the blogosphere (here, here, here, here). The main take out being, that

Read more »

CoRe in CiRM [end]

July 17, 2010
By
CoRe in CiRM [end]

Back home after those two weeks in CiRM for our “research in pair” invitation to work on the new edition of Bayesian Core, I am very grateful for the support we received from CiRM and through it from SMF and CNRS. Being “locked” away in such a remote place brought a considerable increase in concentration

Read more »

Chart the U.S. Gross National Product with the Federal Reserve API

June 17, 2010
By
Chart the U.S. Gross National Product with the Federal Reserve API

The Federal Reserve of St. Louis has an amazing amount of economic data available through their API. You need to apply for an API key, and once you have been approved you include your API key as URL parameter to access your data. api_key='YOUR API KE...

Read more »

Clustergram: visualization and diagnostics for cluster analysis (R code)

June 15, 2010
By
Clustergram: visualization and diagnostics for cluster analysis (R code)

About Clustergrams In 2002, Matthias Schonlau published in “The Stata Journal” an article named “The Clustergram: A graph for visualizing hierarchical and . As explained in the abstract: In hierarchical cluster analysis dendrogram graphs are used to visualize how clusters are formed. I propose an alternative graph named “clustergram” to examine how cluster members are assigned to clusters as...

Read more »

Baseball, basketball, and (not) getting better as time marches on

June 2, 2010
By
Baseball, basketball, and (not) getting better as time marches on

PROS ARE NOT GETTING BETTER AT FREE THROWS Rick Larrick recently told Decision Science News that baseball players have been getting better over the years in a couple ways. First, home runs and strikeouts have increased. The careless or clueless reader might note that this is curious, for from the batter’s perspective home runs are

Read more »

Testing Out my Pitch F/X Data

May 25, 2010
By
Testing Out my Pitch F/X Data

I recently got all the Pitch F/X data downloaded from Gameday, and have been fiddling around. I certainly don't have the physics knowledge to really talk about the movement at this point, and I'm still acquainting myself with the data format and what e...

Read more »

Computational Statistics

May 9, 2010
By
Computational Statistics

Do not resort to Monte Carlo methods unnecessarily. When I received this 2009 Springer-Verlag book, Computational Statistics, by James Gentle a while ago, I briefly took a look at the table of contents and decided to have a better look later… Now that I have gone through the whole book, I can write a short

Read more »

Fun with R: Clustering and MDS

May 5, 2010
By
Fun with R: Clustering and MDS

I've seen K-means clustering, PCA, etc. done some over at Beyond the Boxscore and Baseball Analysts (and the now defunct Statspeak), but I thought I'd just check out some clustering on the young fantasy season using the traditional 5x5 categories with ...

Read more »

Tipping heuristics

April 28, 2010
By
Tipping heuristics

INCREDIBLY SIMPLE CALCULATIONS MADE SIMPLE Yes, we all know how to calculate 15% or 20% exactly, but it’s fun to use tipping heuristics and even more fun to make crowded graphs of how they compare to each other. (Sorry for the junky chart. Open for suggestions, in the words of Tom Waits.) Here are a

Read more »

Estimating Missing Data with aregImpute() {R}

April 19, 2010
By

  Missing Data Soil scientists routinely sample, characterize, and summarize patterns in soil properties in space, with depth, and through time. Invariably, some samples will be lost or sufficient funds required for complete characterization can run out. In these cases the scientist is left with a data table that contains holes (so to speak) in the rows/columns that are...

Read more »