For a wider use of R

October 22, 2010
By

Two things that are crucial for a wider use of R among applied researchers. The first one is data manipulation/reshaping tool. I think the package "reshape" and "reshape2" have done good job and have largely removed the barrier. The second one is ...

Read more »

How to avoid annoying a referee

October 22, 2010
By

It’s not a good idea to annoy the referees of your paper. They make recommendations to the editor about your work and it is best to keep them happy. There is an interesting discussion on stats.stackexchange.com on this subject. This inspired my own list below. Explain what you’ve done clearly, avoiding unnecessary jargon. Don’t claim

Read more »

abc

October 21, 2010
By
abc

Michael Blum and Olivier François, along with Katalin Csillery, just released an R package entitled abc. (I am surprised the name was not already registered!) Its aim is obviously to implement ABC approximations for Bayesian inference: Description The ’abc’ package provides various functions for parameter estimation and model selection in an ABC framework. Three main

Read more »

Promising R Packages

October 21, 2010
By

As a quick note, here are two R packages that were mentioned to me recently and that look promising: reldist and mixtools.

Read more »

vecLib: Why Mac users are better off with Open Source R

October 21, 2010
By
vecLib: Why Mac users are better off with Open Source R

Thanks to its use of Apple's vecLib for BLAS, LAPACK and related functions, the open source distribution of R performs many matrix calculations 2-10X faster than Revolution Analytics Community edition on Mac OS X systems.

Read more »

R is Hot: Part 3

October 21, 2010
By

This is Part 3 of a five-part article series, with new parts published each Thursday. You can download the complete article from the Revolution Analytics website. Power from Elegance If the R movement has a genuine rock star, it’s probably Hadley Wickham. He’s an assistant professor and the Dobelman Family Junior Chair in Statistics at Rice University. He’s written...

Read more »

Chicago R Meetup slides

October 21, 2010
By

Second stop of my world tour was chicago yesterday night, where I presented a quick light review of various ways to represent objects in R: lexical scoping, S3, S4, the new reference classes and also with C++ using Rcpp modules or RProtoBuf My sli...

Read more »

Ricky Ponting and Sachin Tendulkar

October 21, 2010
By
Ricky Ponting and Sachin Tendulkar

There is a lot of hype in media about Ponting's best days being behind him. While he's rebuilding a fresh Aussie team, everyone is putting more pressure on him by comparing his performance with Tendulkar's. Leave him alone!We have the privilege of watc...

Read more »

Ricky Ponting and Sachin Tendulkar

October 21, 2010
By
Ricky Ponting and Sachin Tendulkar

There is a lot of hype in media about Ponting's best days being behind him. While he's rebuilding a fresh Aussie team, everyone is putting more pressure on him by comparing his performance with Tendulkar's. Leave him alone!We have the privilege of watc...

Read more »

What’s that 5km from the station “location”

October 21, 2010
By
What’s that 5km from the station “location”

In our last installment we looked at stations which were pitch black. The case I examined, Middlesboro Kentucky illustrated 1. The station location data used by Hansen2010 has inaccuracies. 2. While the purported station location was pitch dark, nearby within a couple 1/100ths of a degree there were urban lights. What this example illustrated was

Read more »

Finding presence data for species distribution modelling (SDM)

October 20, 2010
By
Finding presence data for species distribution modelling (SDM)

Getting presence data of species is often not easy and can be a major obstacle when attempting to model the distribution of species. One way is using the GBIF data base. Here I show one way how to obtain presence … Continue reading →

Read more »

Installing rJava on Ubuntu

October 20, 2010
By
Installing rJava on Ubuntu

I had some troubles installing rJava on Ubuntu. The easiest solution I found was installing from the command line using: Then it worked without any problems.

Read more »

The "tikzDevice" package

October 20, 2010
By
The "tikzDevice" package

The tikzDevice package is quite amazing. Here are two graphs I just made, with (lower) and without (upper) using the tikzDevice package. The difference in quality is huge.Since the LaTeX source file for the figure is quite large in size and may take si...

Read more »

The "tikzDevice" package

October 20, 2010
By
The "tikzDevice" package

The tikzDevice package is quite amazing. Here are two graphs I just made, with (lower) and without (upper) using the tikzDevice package. The difference in quality is huge.Since the LaTeX source file for the figure is quite large in size and may take si...

Read more »

The 2009 homicide data for Chihuahua has been updated

October 20, 2010
By
The 2009 homicide data for Chihuahua has been updated

As I described in this post, the reported number of homicides in Chihuahua during 2009 turned out to be incomplete. Guess what? The SNSP just released an updated version of the homicide data, and Chihuahua went from having 2,523 homicides to 3,156. My estimate was 3,256, so I came pretty...

Read more »

The 2009 homicide data for Chihuahua has been updated

October 20, 2010
By
The 2009 homicide data for Chihuahua has been updated

As I described in this post, the reported number of homicides in Chihuahua during 2009 turned out to be incomplete. Guess what? The SNSP just released an updated version of the homicide data, and Chihuahua went from having 2,523 homicides to 3,156. My estimate was 3,256, so I came pretty...

Read more »

R Links for the Beginner on World Statistics Day

October 20, 2010
By

In honor of the first World Statistics Day I thought I would share some of my favorite R links.  R is a free software statistical computing environment for performing all sorts of data and mathematical manipulation.Introduction and TutorialsR Tuto...

Read more »

R Links for the Beginner on World Statistics Day

October 20, 2010
By

In honor of the first World Statistics Day I thought I would share some of my favorite R links.  R is a free software statistical computing environment for performing all sorts of data and mathematical manipulation.Introduction and TutorialsR Tuto...

Read more »

Hold on to your hats: it’s World Statistics Day!

October 20, 2010
By

Apparently today is the first ever World Statistics Day. I only knew about it because I'd seen a couple of passing references to it from the stats folks I follow on Twitter. But I guess this UN-sponsored event is a big deal, judging from the official website: The celebration of the World Statistics Day will acknowledge the service provided...

Read more »

Trading secrets

October 20, 2010
By
Trading secrets

Recently I had the opportunity to do a job swap with one of the guys in the laboratory here at HSL.  I helped out with the mass-spectrometry and James helped me with the data analysis.  Two very useful things came out of this. Firstly, it’s been very informative to see how the data I get

Read more »

Programming with R – Function Basics

October 20, 2010
By

One of the benefits of using R for statistical analysis is the programming language which allows users to define their own functions, which is particularly useful for analysis that needs to be repeated. For example, a monthly output from a database may be provided in a pre-determined format and we might be interested in running

Read more »

Central Limit Theorem A nice illustration of the Central Limit…

October 20, 2010
By
Central Limit Theorem
A nice illustration of the Central Limit…

Central Limit Theorem A nice illustration of the Central Limit Theorem by convolution.in R: Heaviside 0,1,0) }HH

Read more »

upgrade R – F77 cause compilation error

October 20, 2010
By

I try to compile the source code of R 2.12 on CentOS, but it throw an error when trying to install *cluster*. * installing *source* package ‘cluster’ ... Read More: 3800 Words Totally

Read more »

upgrade R – F77 cause compilation error

October 20, 2010
By

I try to compile the source code of R 2.12 on CentOS, but it throw an error when trying to install *cluster*. * installing *source* package ‘cluster’ ... Read More: 3800 Words Totally

Read more »

Transactions, and Pondering their Use in Casinos

October 20, 2010
By
Transactions, and Pondering their Use in Casinos

A couple of weeks ago, Bradford Cross of FlightCaster posted in Measuring Measures that transactions are the next big data category. I argue that they already are, and from reading his blog post, he seems to suggest this as well but I will admit that I think I missed his point. There are some clear examples of transactions and...

Read more »

Monitoring Productivity Experiment

October 20, 2010
By
Monitoring Productivity Experiment

For over a year now, i've been collecting how much time i spend in computer and how much of it is actually used in creative/productive activities. By productive activity i mean that the time spent in text editor(emacs), terminal, excel or a datab...

Read more »

Coincidence in lotteries

October 19, 2010
By
Coincidence in lotteries

Last weekend, my friend and coauthor Jean-Michel Marin was interviewed (as Jean-Claude Marin, sic!) by a national radio about the probability of the replication of a draw on the Israeli Lottery. Twice the same series of numbers appeared within a month. This lotery operates on a principle of 6/37 + 1/8: 6 numbers are drawn

Read more »

Happy World Statistics Day!

October 19, 2010
By

The United Nations has declared today “World Statistics Day”. I’ve no idea what that means, or why we need a WSD. Perhaps it is because the date is 20.10.2010 (except in North America where it is 10.20.2010). But then, what happens from 2013 to 2099? And do we just forget the whole idea after 3112?

Read more »

Example 8.10: Combination dotplot/boxplot (teaching graphic in honor of World Statistics Day)

October 19, 2010
By
Example 8.10: Combination dotplot/boxplot (teaching graphic in honor of World Statistics Day)

In honor of World Statistics Day and the read paper that my co-authors Chris Wild, Maxine Pfannkuch, Matt Regan, and I are presenting at the Royal Statistical Society today, we present the R code to generate a combination dotplot/boxplot that is useful...

Read more »