2645 search results for "GIS"

Because it’s Friday: Spurious correlation edition

August 26, 2011
By
Because it’s Friday: Spurious correlation edition

If the Flight of the Concords taught me anything, it's that you can't trust Australians. This morning I was poking around the DataMarket site, when I noticed something suspicious about Australian sheep production: I decided to investigate further: ju...

Read more »

How to access 100M time series in R in under 60 seconds

August 25, 2011
By
How to access 100M time series in R in under 60 seconds

DataMarket, a portal that provides access to more than 14,000 data sets from various public and private sector organizations, has more than 100 million time series available for download and analysis. (Check out this presentation for more info about DataMarket.) And now with the new package rdatamarket, it's trivially easy to import those time series into R for charting,...

Read more »

Things I learned at useR!2011

August 25, 2011
By
Things I learned at useR!2011

The title says “things” but conferences are mainly about people. Some of it can be serendipitous.  For example, one day I sat next to Jonathan Rougier at lunch because I had a question for him about climate models.  When Jonathan left, I started a conversation with the person on my other side.  That was most … Continue reading...

Read more »

Graphically analyzing variable interactions in R

August 23, 2011
By
Graphically analyzing variable interactions in R

I studied Ecology as an undergraduate, which meant I spent a lot of time gathering and analyzing field data. One of the basic tools we used to look for relationships in a large set of variables was correlation and scatterplot matrices. Each of these ...

Read more »

Accelerating path-dependent loops: A quick Rcpp case study

August 23, 2011
By

User BobH asked on StackOverflow about accelerating path-dependent loops. He provided a simple example in which a vector gets filled conditional on the value of the preceding element. Simple to code, but hard to vectorise. By the time I saw that q...

Read more »

Anonymising data

August 23, 2011
By
Anonymising data

There are only three known jokes about statistics in the whole universe, so to complete the trilogy (see here and here for the other two), listen up: Three statisticians are on a train journey to a conference, and they get chatting to three epidemiologists who are also going to the same place. The epidemiologists are

Read more »

SIGKDD 2011 Conference — Day 1 (Graph Mining and David Blei/Topic Models)

August 22, 2011
By
SIGKDD 2011 Conference — Day 1 (Graph Mining and David Blei/Topic Models)

I have been waiting for the KDD conference to come to California, and I was ecstatic to see it held in San Diego this year. AdMeld did an awesome job displaying KDD ads on the sites that I visit, sometimes multiple times per page. That’s good targeting! Mining and Learning on Graphs Workshop 2011 I had originally planned to attend the...

Read more »

Recession forecasting II: Assessing Hussman’s Accuracy

August 22, 2011
By
Recession forecasting II: Assessing Hussman’s Accuracy

In my last post on recessions, I implemented John Hussman's Recession Warning Composite in R. In this post I will examine how well this index performs and discuss how we might improve it. If you would like to follow along at home, be sure to run the ...

Read more »

A view of useR!2011

August 22, 2011
By
A view of useR!2011

Start Brian Ripley The conference was opened with a talk by Brian Ripley.  I’ll distort his talk into 3 points that came across to me. 1. R Core is finite The time available from R Core members is a strictly limited good.  The more that is pushed onto R Core, the less attention to details.  … Continue reading...

Read more »

useR! Conference 2011 highlights

August 20, 2011
By
useR! Conference 2011 highlights

I was at the useR! Conference at The University of Warwick in Coventry, UK, last week. My goal in going was to learn the latest things regarding (simple) dynamic graphics, (simple) web-based apps, parallel computing, and memory management (dealing with big data sets). I got just what I was hoping for and more. There are

Read more »