January 30, 2013
A Problem A major problem in secondary data analysis is that you didn't get to decide what data was collected. Lets say you were interested in how many times a student has read the Twilight books). Specifically, you want to know how effective the ads for...

## The "golden age" of a football player

January 28, 2013
It's been some time since my last post on football. And we're talking about european soccer here. So I finally managed to write some functions which allow me to extract player stats from www.transfermarkt.de. The site tracks lots of stats in the world of soccer. For each player, there is information about the dominant foot, height, age, the estimated...

## Formulae in R: ANOVA and other models, mixed and fixed

January 10, 2013
R’s formula interface is sweet but sometimes confusing. ANOVA is seldom sweet and almost always confusing. And random (a.k.a. mixed) versus fixed effects decisions seem to hurt peoples’ heads too. So, let’s dive into the intersection of these three. I’m aware that there are lots of packages for running ANOVA models that make things nicer

## Data science = failure of imagination

January 8, 2013
From: http://www.r-bloggers.com/data-driven-science-is-a-failure-of-imagination/I think I like this distinction between Bayesian and Frequentist statistics: "we are nearly always ultimately curious about the Bayesian probability of the hypothesis ...

## Generation of E-Learning Exams in R for Moodle, OLAT, etc.

December 20, 2012
(Guest post by Achim Zeileis) Development of the R package exams for automatic generation of (statistical) exams in R started in 2006 and version 1 was published in JSS by Gr?n and Zeileis (2009). It was based on standalone Sweave exercises, that can be combined …Read more »

## Matrix Algebra Useful for Statistics

December 16, 2012
I was having a conversation with an acquaintance about courses that were particularly useful in our work. My forestry degree involved completing 50 compulsory + 10 elective† courses; if I had to choose courses that were influential and/or really useful they would be Operations Research, Economic Evaluation of Projects, Ecology, 3 Calculus and 2 Algebras.

## Clarifying a Previous Item Response Theory Link

December 11, 2012
I wanted to thank those of you who have emailed and commented on my last post and its Link to Item Response Theory Presentations Using R.  In the Psychometrics Centre website, it is not until Topic 7 called Unidimensional IRT models for binar...

## The Hour of Hell of Every Morning – Commute Analysis, April to October 2012

November 19, 2012
IntroductionSo a little while ago I quit my job.Well, actually, that sounds really negative. I'm told that when you are discussing large changes in your life, like finding a new career, relationship, or brand of diet soda, it's important to frame things positively.So let me rephrase that - I've left job I previously held to pursue other directions. Why?...

## Making sense of random effects

November 16, 2012
The other night in my office I got into a discussion with my office mate, the brilliant scientist / amazing skier Dr. Thor Veen about how to understand the random effect variance term in a mixed-effects model. Thor teaches the R statistics course here at UBC, and last night a student came to the office...