# Monthly Archives: July 2011

## Parallel random forests using foreach

July 22, 2011
By

There's been some discussion on the kaggle forums and on a few blogs about various ways to parallelize random forests, so I thought I'd add my thoughts on the issue.Here's my version of the 'parRF' function, which is based on the elegant version in the...

## Prepping for useR! 2011 – tty connection update

July 22, 2011
By

I'm putting together my presentation for useR! 2011 titled "Experimenting with a tty connection for R". Hence, I've updated the tty connection patch to work with R versions 2.13.0 and 2.13.1. And, instead of re-listing the patch files and re-writing instructions on their application, I've devoted a small portion of my Code page for this

## A Quick Look At Unemployment

July 21, 2011
By

Labor market tightness is defined as the vacancies or job openings rate divided by the unemployment rate.  The theory goes that as job openings increase relative to the unemployment rate a tightness is created in that workers get the upper hand in...

## Smoothing temporally correlated data

July 21, 2011
By
$Smoothing temporally correlated data$

Something I have been doing a lot of work with recently are time series data, to which I have been fitting additive models to describe trends and other features of the data. When modelling temporally dependent data, we often need … Continue reading →

July 21, 2011
By

R reminds me a lot of English. It’s easy to get started, but very difficult to master. So for all those times I’ve spent… well, forever… trying to figure out the “R way” of doing something, I’m glad to share these quick wins. My recent R tutorial on mining Twitter for consumer sentiment wouldn’t have

## Showcasing the latest phylogenetic methods: AUTEUR

July 20, 2011
By

While high-speed fish feeding videos may be the signature of the lab, dig a bit deeper and you’ll find a wealth of comparative phylogenetic methods sneaking in.  It’s a natural union — expert functional morphology is the key to good comparative methods, just as phylogenies hold the key to untangling the evolutionary origins of that

## Regional differences on what drives CO2 emissions

July 20, 2011
By

If you are investigating the change of CO2 emissions, then you might ask: Where do the changes occur? Well here is the answer.The staircase plots show the contributing factors to CO2 emissions for each continent. population refers to population effects, gdp_pcap refers to income per capita, energy_intensity refers to energy used per dollar added value, and carbon intensity...

## Slides for Reproducible Research Talk at Interface 2011

July 20, 2011
By

I gave a talk at the Interface Symposium on reproducible research in practice. I went first in the session, so the slides have a bit more background and philosophy. It was a great session; one of Jon Claerbout's colleagues spoke, Sergey Fomel, a founding author of Madagascar; Sorin Mitran from UNC Chapel Hill talked about

## Visualizing Kickstarter Projects with R

July 20, 2011
By

Kickstarter, a social funding platform where individuals can chip in cash to get a worthy project going, just celebrated their 10,000th kickstarted project. Kickstart employee Fred Benenson recognized the achievement by visualizing the funding of music, design, art, game and many other kinds of projects using R and ggplot2. For example, here's a chart that shows the increasing rate...

## Shorting Mebane Faber

July 19, 2011
By

Although I do not personally know Mebane Faber, I know enough that I do not want to short him. However, I thought it would be insightful to see how the short side of his “A Quantitative Approach To Tactical Asset Allocation” might look.  Once ...