Posts Tagged ‘ statistics ’

introduction to R: learning by doing (part 1)

July 9, 2012
By
introduction to R: learning by doing (part 1)

Geography is often about statistics as it is the basis for fast exchange of information: providing a mean and standard deviation to the audience is often much easier then showing raw data: Learning a script language for this purpose can be a hard-ass work. But I think it is more often a need of practice.

Read more »

The role of Statistics in the Higgs Boson discovery

July 3, 2012
By
The role of Statistics in the Higgs Boson discovery

News is starting to leak that the Large Hadron Collider may have accomplished its primary mission of confirming the existence of the hypothesised and heretofore elusive subatomic particle, the Higgs Boson. And sure, billions of Euros worth of state-of-the-art high-energy machinery and an army of experimental and theoretical physicists probably had something to do with the discovery. But did...

Read more »

MatLab, SAS, STATA, SPSS, Excel users: Try R, damn it!

July 2, 2012
By
MatLab, SAS, STATA, SPSS, Excel users: Try R, damn it!

Due to my work with a multitude of statistical packages in my career I may be able to evaluate a lot of them. I’ve first used Excel for my calculations as most of the normal users do. I like the idea behind a spreadsheet and the combination of data and click-to-do functions. Nevertheless I’ve often

Read more »

Modeling Trick: Masked Variables

July 1, 2012
By
Modeling Trick: Masked Variables

A primary problem data scientists face again and again is: how to properly adapt or treat variables so they are best possible components of a regression. Some analysts at this point delegate control to a shape choosing system like neural nets. I feel such a choice gives up far too much statistical rigor, transparency and Related posts:

Read more »

Trying for a baby? Here’s how long it might take.

June 29, 2012
By
Trying for a baby? Here’s how long it might take.

Wanting to start a family the natural way? For a healthy 45-year-old woman, you may be in for a five-year wait. That's the conclusion of Richie Cotton, a UK-based data scientist, who discovered when he and his girlfriend wanted to start a family that statistics on how long it takes to get pregnant are hard to come by. The...

Read more »

How do I Create the Identity Matrix in R?

June 27, 2012
By
How do I Create the Identity Matrix in R?

I googled for this once upon a time and nothing came up. Hopefully this saves someone ten minutes of digging about in the documentation. You make identity matrices with the keyword diag, and the number of dimensions in parentheses. > diag(3) [,...

Read more »

Bayesian Nonparametrics in R

June 25, 2012
By
Bayesian Nonparametrics in R

On July 25th, I’ll be presenting at the Seattle R Meetup about implementing Bayesian nonparametrics in R. If you’re not sure what Bayesian nonparametric methods are, they’re a family of methods that allow you to fit traditional statistical models, such as mixture models or latent factor models, without having to fully specify the number of

Read more »

The Great Julia RNG Refactor

June 21, 2012
By

Many readers of this blog will know that I’m a big fan of Bayesian methods, in large part because automated inference tools like JAGS allow modelers to focus on the types of structure they want to extract from data rather than worry about the algorithmic details of how they will fit their models to data.

Read more »

useR 2012: impressions, tutorials

June 19, 2012
By
useR 2012: impressions, tutorials

First of all, useR 2012 (the 8th International R User Conference) was, hands down, the best-organized conference I’ve had the luck to attend. The session chairs kept everything moving on time, tactfully but sternly; the catering was delicious and varied; … Continue reading →

Read more »

Time Series Data Library now on DataMarket

June 19, 2012
By

The Time Series Data Library is a collection of about 800 time series that I have maintained since about 1992, and hosted on my personal website. It includes data from a lot of time series textbooks, as well as many other series that I’ve either collected for student projects or helpful people have sent to me. I’ve now moved...

Read more »