RClimate Script: Sea Surface Temperature (SST) Anomaly Trends

February 1, 2010
By
RClimate Script: Sea Surface Temperature (SST) Anomaly Trends

This RClimate Script lets users retrieve and plot the National Climatic Data Center’s monthly Sea Surface Temperature (SST) Anomaly dat series .  Links to the NCDC data file and my RClimate script are included. Users can run my script with a simple R source() statement. NCDC  SST Anomaly I’ve discussed the Hadley SST anomaly trends

Read more »

R Tutorial Series: Regression With Categorical Variables

February 1, 2010
By
R Tutorial Series: Regression With Categorical Variables

Categorical predictors can be incorporated into regression analysis, provided that they are properly prepared and interpreted. This tutorial will explore how categorical variables can be handled in R.Tutorial FilesBefore we begin, you may want to download the sample data (.csv) used in this tutorial. Be sure to right-click and save the file to your R working directory....

Read more »

R Tutorial Series: Regression With Categorical Variables

February 1, 2010
By
R Tutorial Series: Regression With Categorical Variables

Categorical predictors can be incorporated into regression analysis, provided that they are properly prepared and interpreted. This tutorial will explore how categorical variables can be handled in R.Tutorial FilesBefore we begin, you may want to download the sample data (.csv) used in this tutorial. Be sure to right-click and save the file to your R working directory....

Read more »

Some Python Nooks and Crannies

January 31, 2010
By
Some Python Nooks and Crannies

I spent this weekend reading Learning Python (Second Edition for Python 2.3!) by Mark Lutz. Python is my favorite programming language, but my experience with it has been mostly anecdotal; I come up with my own solutions and functions and I Google whatever I do not know. I decided to spend a couple of days with this incredibly out-of-date...

Read more »

Rcpp 0.7.4

January 31, 2010
By

Yesterday, and about nine days after release 0.7.3 of Rcpp (a set of R / C++ interface classes), Romain and I released version 0.7.4. It has been uploaded to CRAN and Debian, and mirrors should already have new versions. As before, my local page is als...

Read more »

Rcpp 0.7.4

January 31, 2010
By

Yesterday, and about nine days after release 0.7.3 of Rcpp (a set of R / C++ interface classes), Romain and I released version 0.7.4. It has been uploaded to CRAN and Debian, and mirrors should already have new versions. As before, my local page is ...

Read more »

With With

January 31, 2010
By

No that is not a typo in the title. In my programming a came across a solution that I thought was pretty cool. I have a function that basically takes two objects and passes the elements of the objects to another function as arguments. This is a pret...

Read more »

Congruential generators all are RANDUs!

January 30, 2010
By
Congruential generators all are RANDUs!

In case you did not read all the slides of Regis Lebrun’s talk on pseudo-random generators I posted yesterday, one result from Marsaglia’s (in a 1968 PNAS paper) exhibited my ignorance during Regis’ Big’ MC seminar on Thursday. Marsaglia indeed showed that all multiplicative congruential generators lie on a series of hyperplanes whose number gets ridiculously

Read more »

Practical Implementation of Neural Network based time series (stock) prediction – PART 2

January 30, 2010
By
Practical Implementation of Neural Network based time series (stock) prediction – PART 2

As a brief follow up to the series, I want to take a moment to describe a bit about Weka, which is the machine learning tool that we will be using to implement the neural network. It is a fantastic open source JAVA based tool that was developed at the...

Read more »

Mining Tuition Data for US Colleges and Universities, and a Tangent

January 30, 2010
By
Mining Tuition Data for US Colleges and Universities, and a Tangent

I wrote this script for the UCLA Statistical Consulting Center. I don’t know all of the specifics, but one of our faculty members has this idea that we can help our paper, The Daily Bruin, with their graphics or something to that effect. I don’t quite understand because our paper has never really been big on graphics for data,...

Read more »

Practical Implementation of Neural Network based time series (stock) prediction – PART 1

January 29, 2010
By
Practical Implementation of Neural Network based time series (stock) prediction  – PART 1

The following introduction is to allow viewers to understand the basic concepts and practical implementation of neural nets towards a financial time series. I will not go too deep into detail about the mathematics behind the neural net at the moment. ...

Read more »

Big’MC seminar

January 29, 2010
By
Big’MC seminar

Two very interesting talks at the Big’ MC seminar on Thursday: – Phylogenetic models and MCMC methods for the reconstruction of language history by Robin Ryder – Uniform and non-uniform random generators by Régis Lebrun which are both on topics close to my interest, evolution of languages (I’ll be a philologist in another life!) and uniform random generators. Filed

Read more »

R creators win prestigious Statistical Computing and Graphics Award

January 29, 2010
By

The American Statistical Association recently created a new, bi-annual award to to recognize an individual or team for innovation in computing, software, or graphics that has had a great impact on statistical practice or research. The committee has just announced the winner (or in this, joint winners) of the first award: Robert Gentleman and Ross Ihaka, for their work...

Read more »

Crayola crayon colors, 1949-present

January 29, 2010
By
Crayola crayon colors, 1949-present

Here's an example I featured in my list of 7 Awesome Things about R (awesome thing #3: graphics and data visualization). The Learning R blog features a reproduction of a graphic that recently appeared on Flowing Data. It shows the colors in a box of Crayola crayons: before 1949 there were only 8, but over the years additional colors...

Read more »

Looking for a Bayésien PhD

January 28, 2010
By
Looking for a Bayésien PhD

I just got this email (yes, in French) looking for a Bayesian ready to work on algorithms: Dans le cadre de la société Vekia, nous recherchons un Docteur en statistiques bayésiennes pour un poste sur Lille à pourvoir dès que possible. Vekia est  un éditeur de logiciel pour le commerce fondée en 2007 par deux chercheurs (Pierre-Arnaud

Read more »

RClimate Script: NINO 3.4 SST Anomaly Trends

January 28, 2010
By
RClimate Script: NINO 3.4 SST Anomaly Trends

This RClimate Script lets users retrieve and plot the weekly NOAA NINO 3.4 SST  anomaly data for 1990 to the most recent value.  Links to the NOAA data file and my RClimate script are included. Users can run my script with a simple R source() statement. NINO 3.4 SST Anomaly I’ve discussed ENSO and NINO

Read more »

Introduction to R webinar today, slides available

January 28, 2010
By

Just a quick reminder that I'll be hosting an introductory webinar about R today, The R Project: Data Analysis and Statistical Graphics for the Enterprise. It's at 9AM Pacific, so you might still have time to register for the live session at the link below. Otherwise, if you did catch the live session, you can pick up the slides...

Read more »

Advanced Graphics in R

January 27, 2010
By
Advanced Graphics in R

Each quarter the UCLA Statistical Consulting Center hosts minicourses twice per week in R and LaTeX. Tonight was my turn to present. I presented Advanced Graphics in R. This was the same presentation I gave at the LA R Users’ Group in August will a fellow consultant. She and I had trouble coming together to make one presentation, so we...

Read more »

From the “blogosphere”? Hardly.

January 27, 2010
By
From the “blogosphere”? Hardly.

I generally skip over “From the Blogosphere”, a (mostly) weekly-summary of one or two blog posts in Nature’s “Authors” section (here is the latest). Why? Well, I’ve always suspected that the title is rather misleading. Now, I have the hard numbers to prove it. My feed reader contains an archive of 128 articles, dating back

Read more »

Re-mapping Massachusetts Special election results

January 27, 2010
By
Re-mapping Massachusetts Special election results

I had previously posted maps showing the difference in major party vote share between the 2008 Presidential election and the 2010 special Senate election in Massachusetts. Colleagues and readers of the Revolutions blog had some very insightful criticisms of these maps, in particular that the color scale was over-stating the swing in voter sentiment. I’ve

Read more »

RClimate Script: Polar Amplification – 2000 to 2009

January 27, 2010
By
RClimate Script: Polar Amplification – 2000 to 2009

This RClimate Script lets users retrieve and plot the NASA GISS temperature anomaly data for 2000 to 2009 by latitude zone during the past decade. A link is provided to the NASA GISS data generation query page as well as links to my saved file of the GISS data and my RClimate script that users

Read more »

How to combine Google maps and data in R

January 27, 2010
By
How to combine Google maps and data in R

Every good artist needs a canvas, and when it comes to displaying geographic data placing those data in context -- on a map -- makes all the difference. A new package for R from Markus Loecher, RgoogleMaps, allows you to download a street or satellite map from Google simply by specifying the bounding latitude/longitude coordinates. (You need to sign...

Read more »

Bayesian courses in København

January 26, 2010
By
Bayesian courses in København

I received this announcement about two incoming courses given in København by Andrew Lawson: 1) “*An Introduction to Bayesian Disease Mapping*” A Two-Day Course, April 12.- 13. 2010, University of Southern Denmark This course is designed to provide an introduction to the area of Bayesian disease mapping in applications to Public Health and Epidemiology: 2) “*Advanced Bayesian Disease Mapping*” A

Read more »

What programmers should know about Statistics

January 26, 2010
By

Reader KW pointed me to this rant essay from Ruby on Rails enfant terrible Zed Shaw on what computer programmers don't know about statistical analysis, but should. (Spoiler alert: a lot, apparently.) Perhaps surprisingly, building complex software systems often involves a lot of simulation, experimentation, and measurement for which statistical methods would be an asset. But according to Shaw,...

Read more »

What Countries are ‘Pulling their Weight’ for Haiti?

January 26, 2010
By
What Countries are ‘Pulling their Weight’ for Haiti?

Using the data provided ReliefWeb on the Appeals and Funding to Haiti (h/t DataBlog) and the most recent GNP estimates, I decided to do a little “back of the envelope” analysis. With GNP as a proxy for a country’s wealth, the hypothesis is that pledges should roughly be a linear function of wealth, i.e., the

Read more »

Free GIS Resources

January 26, 2010
By
Free GIS Resources

Over the last couple of days I have utilised some excellent free GIS resources. I have listed these and some others below. Geospatial Analysis: This is the free online version of de Smith, Longley and Goodchild’s excellent book by the same title. It provides full coverage of current GIS methodologies. It also provides extensive information

Read more »

ggplot2: Quick Heatmap Plotting

January 25, 2010
By
ggplot2: Quick Heatmap Plotting

A post on FlowingData blog demonstrated how to quickly make a heatmap below using R base graphics. This post shows how to achieve a very similar result using ggplot2. Data Import FlowingData used last season’s NBA basketball statistics provided by databasebasketball.com, and the csv-file with the data can be downloaded directly from its website. >

Read more »

Mapping the Massachusetts election upset with R

January 25, 2010
By
Mapping the Massachusetts election upset with R

The blog Offensive Politics has done some in-depth analysis of the recent Senate special-election upset in Massachusetts, comparing the results of victorious Republican candidate Scott Brown to those of the unsuccessful Republican Presidential candidate John McCain in 2008. It's pretty clear that Brown out-performed expectations with Democratic voters, but this chart of the change in Democratic voters from 2008...

Read more »

Robert Gentleman joins REvolution’s board of directors

January 25, 2010
By

We're so excited here at REvolution Computing to announce that Robert Gentleman has joined our board of directors. Robert is one of the two originators of the R Project: a research project between Robert and Ross Ihaka in 1996 was the genesis of the R language. (Both Robert and Ross were profiled in an article in the New York...

Read more »