Don’t be a Turkey

November 9, 2010
By
Don’t be a Turkey

'Indeed, I am moving on: my new project is about methods on how to domesticate the unknown, exploit randomness, figure out how to live in a world we don't understand very well. While most human thought (particularly since the enlightenment) has focused us on how to turn knowledge into decisions, my new mission is to...

Read more »

Don’t be a Turkey

November 9, 2010
By
Don’t be a Turkey

'Indeed, I am moving on: my new project is about methods on how to domesticate the unknown, exploit randomness, figure out how to live in a world we don't understand very well. While most human thought (particularly since the enlightenment) has focused us on how to turn knowledge into decisions, my new mission is to...

Read more »

Forecast estimation, evaluation and transformation

November 9, 2010
By
Forecast estimation, evaluation and transformation

I’ve had a few emails lately about forecast evaluation and estimation criteria. Here is one I received today, along with some comments. I have a rather simple question regarding the use of MSE as opposed to MAD and MAPE. If the parameters of a time series model are estimated by minimizing MSE, why do we

Read more »

Particle learning [rejoinder]

November 9, 2010
By
Particle learning [rejoinder]

Following the posting on arXiv of the Statistical Science paper of Carvalho et al., and the publication by the same authors in Bayesian Analysis of Particle Learning for general mixtures I noticed on Hedibert Lopes’ website his rejoinder to the discussion of his Valencia 9 paper has been posted. Since the discussion involved several points

Read more »

Any R packages to solve Vehicle Routing Problem?

November 9, 2010
By
Any R packages to solve Vehicle Routing Problem?

Are there any R packages to solve Vehicle Routing Problem (VRP)?I looked around but could not find any... Any leads?VRP is a classic combinatorial optimization challenge and has been an active area of research for operations research gurus fo...

Read more »

Any R packages to solve Vehicle Routing Problem?

November 9, 2010
By
Any R packages to solve Vehicle Routing Problem?

Are there any R packages to solve Vehicle Routing Problem (VRP)?I looked around but could not find any... Any leads?VRP is a classic combinatorial optimization challenge and has been an active area of research for operations research gurus fo...

Read more »

R co-creator Ross Ihaka wins Lifetime Achievement Award in Open Source

November 9, 2010
By

The co-creator of R, University of Auckland Associate Professor of Statistics Dr. Ross Ihaka, was yesterday awarded the Catalyst Lifetime Achievement in Open Source Award at the 2010 New Zealand Open Source Awards. From the announcement: Dr. Ihaka is one of the originators of the world-renown ‘R’ programming language and software environment for statistical computing and graphics. In 2008...

Read more »

Promote your favorite R functions

November 9, 2010
By

The 27 base and recommended libraries of the standard R 2.12 distribution together contain 3556 functions (you can check using the code posted after the jump). Many of the functions are commonly used: c, data.frame, rnorm, lm. But some of those functions, while being extremely useful, may be less well known to many R users. Some examples I'd wish...

Read more »

New R User Group in Houston

November 9, 2010
By

The latest local R user group to form is located in Houston, Texas. The first meeting of the Houston R Users Group is tonight at Rice University (in conjunction with the Houston chapter of the ASA). R hackr (typo intended!) Hadley Wickham will be giving a presentation on writing R packages, and you can check out the slides on...

Read more »

Mapping drug war related homicides in 2010

November 9, 2010
By
Mapping drug war related homicides in 2010

There have been some very good visualizations of the Wikileaks data so I decided to create one of the drug war in MexicoThe above map was made using data collected by Walter McKay, mainly from El Universal and El Diario reports. The data is stored as a Google Map...

Read more »

Mapping drug war related homicides in 2010

November 9, 2010
By
Mapping drug war related homicides in 2010

There have been some very good visualizations of the Wikileaks data so I decided to create one of the drug war in MexicoThe above map was made using data collected by Walter McKay, mainly from El Universal and El Diario reports. The data is stored as a Google Map...

Read more »

The ARORA guessing game

November 9, 2010
By
The ARORA guessing game

The game ARORA (A random or real array) is a website that gives you two time series at a time. Your job is to guess which series is real market data and which is permuted data.  It’s fun — try it. With some practice you will probably be able to guess which is which well … Continue reading...

Read more »

Computational position in Texas

November 8, 2010
By
Computational position in Texas

José Bernardo forwaded this announcement that sounds quite attractive (conditional upon living in a remote part of Texas!) Senior Faculty Position in Computational Statistics At Texas A&M University As part of a recognition of the increasing importance in the modeling and computational sciences, the Department of Statistics at Texas A&M University is recruiting for a

Read more »

Using R and Hadoop to analyze VOIP data

November 8, 2010
By

Last month, the newest member of Revolution's engineering team, Saptarshi Guha, gave a presentation at Hadoop World 2010 on using R and Hadoop to analyze 1.3 billion voice-over-IP packets to identify calls and measure call quality. Saptarshi, of course, is the author of RHIPE, which lets R programmers write map-reduce algorithms in the Hadoop framework without needing to learn...

Read more »

The Dataists answer your questions

November 8, 2010
By

The fine bloggers (and R experts) at the Dataists have volunteered to answer questions about data analysis on Reddit: A few months ago, a group of likeminded folks in New York and the San Francisco Bay area decided it was time to start a blog about data, and we can up with the Dataists. Since then we thought about...

Read more »

Example 8.13: Bike ride plot, part 2

November 8, 2010
By
Example 8.13: Bike ride plot, part 2

Before explaining how to make and interpret the plot above, Nick and I want to make a plea for questions--it's hard to come up with useful questions to explore each week!As shown in Example 8.12, data from the Cyclemeter app can be used to make interes...

Read more »

The NYC Marathon

November 8, 2010
By
The NYC Marathon

New York’s annual marathon took place yesterday. Watching a bit of it on television with my friends, I was struck by the much earlier starting time for women than men. Specifically, professional women started running yesterday at 9:10 AM, while professional men start running at 9:40 AM. (This information comes from the runner’s handbook.) I

Read more »

R Beginner’s Guide Book Update: Statistical Analysis with R Released

November 8, 2010
By
R Beginner’s Guide Book Update: Statistical Analysis with R Released

In the final days of October, my beginner's guide to R was released. The book's official title is Statistical Analysis with R and it can be found on the Packt Publishing website. The primary focus of Statistical Analysis with R is helping new users bec...

Read more »

R Beginner’s Guide Book Update: Statistical Analysis with R Released

November 8, 2010
By
R Beginner’s Guide Book Update: Statistical Analysis with R Released

In the final days of October, my beginner's guide to R was released. The book's official title is Statistical Analysis with R and it can be found on the Packt Publishing website. The primary focus of Statistical Analysis with R is helping new users bec...

Read more »

A R wrapper for Google Prediction API

November 8, 2010
By
A R wrapper for Google Prediction API

Since I got the chance to access to both Google Storage for Developers and Google Prediction API (more details here and here), I decided to create a simple wrapper (just 4 basic functions until now) to be capable to play with the Google Prediction API ...

Read more »

A R wrapper for Google Prediction API

November 8, 2010
By
A R wrapper for Google Prediction API

Since I got the chance to access to both Google Storage for Developers and Google Prediction API (more details here and here), I decided to create a simple wrapper (just 4 basic functions until now) to be capable to play with the Google Prediction API ...

Read more »

Le Monde puzzle [43]

November 7, 2010
By
Le Monde puzzle [43]

Here is the puzzle in Le Monde I missed last week: Given a country with 6 airports and a local company with three destinations from each of the six airports, is it possible to find a circular trip with three intermediate stops from one of the airports? From all of the airports? One more airport

Read more »

Wetbulb Temperature

November 7, 2010
By
Wetbulb Temperature

This google map display is just one of 230 GHCN stations that is located in the water. After finding  instances of this phenomena over and over, it seemed an easy thing to find and analyze all such cases in GHCN. The issue matters for a two reasons: In my temperature analysis program I use a

Read more »

Updating meteorological forecasts, part 1

November 7, 2010
By
Updating meteorological forecasts, part 1

As Mark Twain said "the art of prophecy is very difficult, especially about the future" (well, actually I am not sure Mark Twain was the  first one to say so, but if you're interested by that sentence, you can look here). I have been rather su...

Read more »

R is a cool image editor!

November 7, 2010
By
R is a cool image editor!

Here I present some functions I wrote to recreate some of the most common image effect available in all image editor.They require the library rimage.To load the image, use:y <- read.jpeg("path")To display the image, use:plot(y)Original imageSepia tonergb2sepia <- function(img){ iRed <- img*255 iGreen <- img*255 iBlue <- img*255  oRed <- iRed * .393...

Read more »

R is a cool image editor!

November 7, 2010
By
R is a cool image editor!

Here I present some functions I wrote to recreate some of the most common image effect available in all image editor.They require the library rimage.To load the image, use:y <- read.jpeg("path") To display the image, use:plot(y)Original imageSepia tone rgb2sepia <- function(img){ iRed <- img*255 iGreen <- img*255 iBlue <- img*255  oRed <- iRed * .393...

Read more »

Installing R packages

November 6, 2010
By
Installing R packages

Part of the reason R has become so popular is the vast array of packages available at the cran and bioconductor repositories. In the last few years, the number of packages has grown exponentially! This is a short post giving steps on how to actually install R packages. Let’s suppose you want to install the

Read more »

Livin’ la Vida Poisson

November 5, 2010
By
Livin’ la Vida Poisson

Yes, I did just mix English, Spanish and French. And no, I living the “fishy” life, popular opinion to the contrary. Here’s the story. As someone who spends the majority of his time working online, with no oversight, I notice that I tend to drift a lot. I don’t play solitaire, or farm for virtual

Read more »

The bms Function Explained

November 5, 2010
By

This text 'pedagocially' explains how the bms function works code-wise and is intended for people who prefer to program customized adjustments of the bms package. bms is the workhorse function to do the sampling part of Bayesian Model Averaging in the...

Read more »