Heatmap tables

January 7, 2011
By
Heatmap tables

I blogged earlier (http://socialdatablog.com/what-is-wrong-with-this-graph) about the well-known risks of implying a continuous data scale in a graph where there isn't one. I just produced this alternative in the form of a heatmap table, i.e. a heatmap in which the numbers themselves are also shown. Perhaps not quite as intuitive but less misleading. It uses the

Read more »

Heatmap tables

January 7, 2011
By
Heatmap tables

I blogged earlier (http://socialdatablog.com/what-is-wrong-with-this-graph) about the well-known risks of implying a continuous data scale in a graph where there isn't one. I just produced this alternative in the form of a heatmap table...

Read more »

survival curves for Leonid

January 7, 2011
By
survival curves for Leonid

Leonid asked me to do a quick survival analysis of two different types of mouse (m430 and m210) with surgically implanted tumours (or something like that). The data was in the wrong format but after transforming it looked like this: In my opini...

Read more »

Boris Bikes/Barclays Cycle Hire Average Journey Times

January 6, 2011
By

The visualisation above shows the average relative duration of Boris Bikers’ weekday journeys over a 4 month period at hourly intervals. For each time step the average journey time (in seconds) from each docking station has been calculated.This information is interesting because it shows the preference for short journeys around the City of London, whilst ...

Read more »

a survey on ABC

January 6, 2011
By
a survey on ABC

With Jean-Michel Marin, Pierre Pudlo and Robin Ryder, we just completed a survey on the ABC methodology. It is now both arXived and submitted to Statistics and Computing. Rather interestingly, our first draft was written in Jean-Michel’s office in Montpelier by collating the ‘Og posts surveying new ABC papers! (Interestingly because this means that my

Read more »

formatR update (0.1-6)

January 6, 2011
By

A new version of the formatR package is available on CRAN now (binary packages are still on the way). There are three major updates: the inline comments will also be preserved in most cases (in earlier versions, only single lines of comments are preserved) tidy.source() gained a new argument 'text' to accept a character vector

Read more »

web content anlayzer

January 6, 2011
By
web content anlayzer

Just developed a small crawler to check my online content at binfalse.de in terms of W3C validity and the availability of external links. Here is the code and some statistics...

Read more »

Gapminder

January 6, 2011
By
Gapminder

As many people are aware Hans Rosling is an enthusiastic swedish academic with a passion for statistics who recently presented the program The Joy of Stats. One of the great things about Hans Rosling is his presentations and the interactive graphics that he uses to make his points. Fast Tube by Casper The gapminder software

Read more »

New R User Group in Kansas City

January 6, 2011
By

There's a new R User Group based in Kansas City, Kansas. Abraham Mathew just launched the group's website, and is looking for R users in the area to kick things off: This group was started to bring together R users in the Kansas City area to exchange knowledge and provide guidance to new R users. We hope to have...

Read more »

Some market predictions

January 6, 2011
By
Some market predictions

We look at a few forecasts for the year 2011 that we’ve run across, and compare them with the prediction distributions presented in Revised market prediction distributions. FTSE 100 There is a “range forecast” on an Interactive Investor page of 5350 to 6565.  It isn’t clear (to me at least) what this means, but I … Continue reading...

Read more »

RClimate: Converting 5 Global Temperature Anomaly Series to A Common Baseline

January 6, 2011
By
RClimate: Converting 5 Global Temperature Anomaly Series to A Common Baseline

The 5 global land-ocean temperature anomaly (LOTA) series use different baseline periods, making direct comparisons between the series more difficult than it would be if each series had the same baseline period. This post shows how to convert the 5 &#8...

Read more »

sab-R-metrics: Basics of Vectors and Data Calling

January 6, 2011
By

Wednesday, I began a new series called "sab-R-metrics". My hope is that it reduces the frustration that goes along with learning a new programming language and enhances others' ability to perform their own analysis in baseball or other sports. However, these tutorials will hopefully allow you to use these skills in other areas as well. ...

Read more »

sab-R-metrics: Basics of Vectors and Data Calling

January 6, 2011
By

Wednesday, I began a new series called "sab-R-metrics". My hope is that it reduces the frustration that goes along with learning a new programming language and enhances others' ability to perform their own analysis in baseball or other sports. However, these tutorials will hopefully allow you to use these skills in other areas as well. ...

Read more »

Ecological networks from abundance distributions

January 6, 2011
By
Ecological networks from abundance distributions

Another grad student and I tried recently to make a contribution to our understanding of the relationship between ecological network structure (e.g., nestedness) and community structure (e.g., evenness)......Alas, I had no luck making new insights. How...

Read more »

Graph gallery in R

January 6, 2011
By
Graph gallery in R

R is sometime criticized for producing graphs not as elaborated as Matlab ones, or other softwares’. Here is a link to a graph gallery by Romain François to “enhance your data visualization with R”. The corresponding R code is given. Might be useful for ENSAE students for ‘statap’ projects. Below are four examples. The maps

Read more »

formatR update (0.1-6)

January 6, 2011
By

A new version of the formatR package is available on CRAN now (binary packages are still on the way). There are three major updates: the inline comments will also be preserved in most cases (in earlier versions, only single lines of comments are pres...

Read more »

Learning R — Documentation

January 6, 2011
By

We use R a lot.  R takes care of many our basic data management needs.  R is an awesome statistical analysis package.  R allows you to produce exceptional data graphics.  The only problem is … R has a wicked learning …   read ...

Read more »

Short review of the R book

January 5, 2011
By
Short review of the R book

David Scott wrote a review of Introducing Monte Carlo Methods with R in the International Statistical Review that is rather negative, since the main bulk reads as follows: I found some aspects of the book very disappointing. The first chapter (“Basic R Programming”) has some unfortunate mistakes and some statements, which are contentious at least

Read more »

My first R package: zipcode

January 5, 2011
By
My first R package: zipcode

My first package, zipcode, is now available on CRAN. It contains the CivicSpace database of 43,191 U.S. zip codes.

Read more »

sab-R-metrics: Introduction to R

January 5, 2011
By
sab-R-metrics: Introduction to R

In a recent post, I briefly mentioned that I may turn a majority of the focus of this blog to teaching R commands for use with sabermetric analysis. Only a few days later, Ricky Zanker began a new column at The Hardball Times doing just that. But that's okay. Hopefully both his and mine...

Read more »

sab-R-metrics: Introduction to R

January 5, 2011
By
sab-R-metrics: Introduction to R

In a recent post, I briefly mentioned that I may turn a majority of the focus of this blog to teaching R commands for use with sabermetric analysis. Only a few days later, Ricky Zanker began a new column at The Hardball Times doing just that. But that's okay. Hopefully both his and mine...

Read more »

R-bloggers

January 5, 2011
By
R-bloggers

Just a quick FYI note in case you haven't seen this site.R-bloggers is an awesome site, bringing together more than 140 blogs (including mine) about R in a single location. See Tal Galili's motivation for creating the site, and his notes on the site here.

Read more »

New approach to analysis of phylogenetic community structure

January 5, 2011
By
New approach to analysis of phylogenetic community structure

Anthony Ives, of University of Wisconsin-Madison, and Matthew Helmus of the Xishuangbanna Tropical Botanical Garden, present a new statistical method for analyzing phylogenetic community structure in an early view paper in Ecological Monographs. See th...

Read more »

Adap’skiii [day 2]

January 5, 2011
By
Adap’skiii [day 2]

Another exciting day at Adap’skiii!!! Yves Atchadé presented a very recent work on the fundamental issue of estimating the asymptotic variance estimation for adaptive MCMC algorithms, with an intriguing experimental observation that a non-converging bandwidth with rate 1/n was providing better coverage than the converging rate. (I always found the issue of estimating the asymptotic

Read more »

How many gifts did my true love give to me on all twelve nights…

January 5, 2011
By

How many gifts did my true love give to me on all twelve nights of Christmas? After seeing Information is Beautiful’s recent information animation, I decided that I’d make my own. I used R to generate a PDF and then did a screencast of the PDF with ffmpeg to make a video with the appropriate timings. The code for the PDF...

Read more »

Customizing the Theme of Your R HTML Help

January 4, 2011
By
Customizing the Theme of Your R HTML Help

R’s default theme of the HTML help pages is too plain for me to read, but we can easily modify the theme, which is essentially a CSS file. You can find the file under: file.path(R.home('doc'), 'html', 'R.css') Simply replace this file with my version: which looks like: Of course you can design your own R.css

Read more »

The R Journal: December 2010

January 4, 2011
By

Issue 2 of The R Journal (the peer-reviewed journal devoted to R) was published over the Christmas break. In addition to news about the latest release of R, it also includes contributed articles on using GPU processing to fit Bayesian models in R, processing text data in R, solving differential equations in R, and much more. Follow the link...

Read more »

Revised market prediction distributions

January 4, 2011
By
Revised market prediction distributions

This provides revised plots of the prediction distributions published yesterday.  The previous plots of prediction distributions should be ignored — they are not doing as advertised. We show the prediction distribution of levels of several equity indices (plus oil price) at the end of 2011 assuming nothing happens.  That is, we’ve taken out market trends … Continue reading...

Read more »

Creating prediction distributions

January 4, 2011
By
Creating prediction distributions

Here we give details and code for the prediction distributions exhibited in yesterday’s blog post Tis the season to predict. Eight years of returns The equity indices use daily closing levels from the start of 2003.  This data comes from Yahoo. A roughly equivalent technique of selecting the last 2000 daily prices is used for … Continue reading...

Read more »