## upgrade R – F77 cause compilation error

October 20, 2010
By

I try to compile the source code of R 2.12 on CentOS, but it throw an error when trying to install *cluster*. * installing *source* package ‘cluster’ ... Read More: 3800 Words Totally

## Transactions, and Pondering their Use in Casinos

October 20, 2010
By

A couple of weeks ago, Bradford Cross of FlightCaster posted in Measuring Measures that transactions are the next big data category. I argue that they already are, and from reading his blog post, he seems to suggest this as well but I will admit that I think I missed his point. There are some clear examples of transactions and...

## Monitoring Productivity Experiment

October 20, 2010
By

For over a year now, i've been collecting how much time i spend in computer and how much of it is actually used in creative/productive activities. By productive activity i mean that the time spent in text editor(emacs), terminal, excel or a datab...

## Coincidence in lotteries

October 19, 2010
By
$Coincidence in lotteries$

Last weekend, my friend and coauthor Jean-Michel Marin was interviewed (as Jean-Claude Marin, sic!) by a national radio about the probability of the replication of a draw on the Israeli Lottery. Twice the same series of numbers appeared within a month. This lotery operates on a principle of 6/37 + 1/8: 6 numbers are drawn

## Happy World Statistics Day!

October 19, 2010
By

The United Nations has declared today “World Statistics Day”. I’ve no idea what that means, or why we need a WSD. Perhaps it is because the date is 20.10.2010 (except in North America where it is 10.20.2010). But then, what happens from 2013 to 2099? And do we just forget the whole idea after 3112?

## Example 8.10: Combination dotplot/boxplot (teaching graphic in honor of World Statistics Day)

October 19, 2010
By

In honor of World Statistics Day and the read paper that my co-authors Chris Wild, Maxine Pfannkuch, Matt Regan, and I are presenting at the Royal Statistical Society today, we present the R code to generate a combination dotplot/boxplot that is useful...

## EM and Regression Mixture Modeling

October 19, 2010
By

Last night, Drew Conway showed me a fascinating graph that he made from the R package data we’ve recently collected from CRAN. That graph will be posted and described in the near future, because it has some really interesting implications for the structure of the R package world. But for the moment I want to

## Introduction to statistical finance with R

October 19, 2010
By

During the first part of our meeting, Nicolas Christou gave an introduction of statistical finance in R, and presented a package he co-authored with previous PhD student David Diez (2010). Video of the talk is below: During the second part, … Continue reading →

October 19, 2010
By

We have just posted the (mostly definitive) program for Adap’skii, January 3-4, The Canyons, Utah. This is taking place just before and as a satellite of the larger MCMSki III conference, January 4-7, same location. The registration for the conference and for lodging is available through the  MCMCSki III registration page, Remember also that this

## An Old Wives Tale from the 2000 Census

October 19, 2010
By

With the data from the 2010 US Census to be published early next year, here's a cautionary tale from the 2000 Census. If you take a look at the ratio of numbers of men to women in the 5-Percent "PUMS" sample from the 2000 census over various ages, you'll see an odd spike near age 65: What causes this...

## Fast matrix inversion

October 19, 2010
By

Very similar to what has been done to create a function to perform fast multiplication of large matrices using the Strassen algorithm (see previous post), now we write the functions to quickly calculate the inverse of a matrix.To avoid rewriting pages and pages of comments and formulas, as I did for matrix multiplication, this time I'll...

## Fast matrix inversion

October 19, 2010
By

Very similar to what has been done to create a function to perform fast multiplication of large matrices using the Strassen algorithm (see previous post), now we write the functions to quickly calculate the inverse of a matrix.To avoid rewriting pages and pages of comments and formulas, as I did for matrix multiplication, this time I'll...

## Middlesboro Kentucky: Pitch Black?

October 19, 2010
By

In his august draft of Hansen2010, Dr. Hansen makes the following claim: “We present evidence here that the urban warming has little effect on our standard global temperature analysis.  However, in the Appendix we carry out an even more rigorous test. We show there that there are a sufficient number of stations located in “pitch

## Listing gene IDs from hyperGTest

October 19, 2010
By

hyperGTest compute Hypergeomtric p-values for over or under-representation of each GO term in the specified category among the specified gene set. *geneSample* was used as an example. Read More: 1329 Words Totally

## Listing gene IDs from hyperGTest

October 19, 2010
By

hyperGTest compute Hypergeomtric p-values for over or under-representation of each GO term in the specified category among the specified gene set. *geneSample* was used as an example. Read More: 1329 Words Totally

## Sunbelt workshop on SNA in R

October 19, 2010
By

The next Sunbelt Social Networks Conference (XXXI) will take place in February 2011 in St Pete Beach, Florida, USA. The preliminary program is available here. There is still time to submit abstracts (deadline is on October 29, 2010). At the conference I will be teaching the workshop “Introduction to Social Network Analysis with R”. It

## Ideas for World Statistics Day

October 19, 2010
By

World Statistics Day is 2010 October 20.  If you work with data (or you should), then you are a statistician and this is a day for you. Try the Monte Hall problem on your mother. Start reading Bad Science.  I mean the book, but here’s the blog. Take a step towards breaking your spreadsheet addiction … Continue reading...

## NppToR 2.5.0

October 18, 2010
By

NppToR 2.5.0 supports R-2.12.0, improves saved settings and install, also retires the syntax generator.

## Hadley on a Postage Stamp?

October 18, 2010
By

Yet another reason to check out the ggplot2 wiki!

## Hadley on a Postage Stamp?

October 18, 2010
By

Yet another reason to check out the ggplot2 wiki!

## K-Means Redistricting

October 18, 2010
By

U.S. Congressional districts are today drawn with the aim of maximizing the electoral advantage of the state’s majority party, subject to some constraints, including compactness (which can be measured in numerous ways) and a “one person, one vote” standard. What if, instead of minimizing population variance across districts, we aimed to minimize the mean distance between … Continue reading →

## RProtoBuf 0.2.1

October 18, 2010
By

A fresh minor release of RProtoBuf, now at version 0.2.1, has appeared earlier today on CRAN. RProtoBuf provides GNU R bindings for the Google Protobuf data encoding library used and released by Google. This releases extends the recent 0.2.0 re...

## More Graphics with Google earth

October 18, 2010
By

Dr. Paul, R graphics guru, blessed us with his rendition of transparent contour maps drawn on a google earth image: Cool stuff. I’ll be taking his code and turning it into a function and sharing it back: here is the picture his code creates: That is just plain slick.  While I was looking over his

## Cramer’s Stock Pick Recommendations Analyzed (Part II)

October 18, 2010
By

This is the second post (previous one here) that provides an analysis of Cramer's stock recommendations based upon the Mad Money Stock Screener as of 10/15/2010.Recommendations by SegmentAs mentioned in the previous post, ...

## Cramer’s Stock Pick Recommendations Analyzed (Part II)

October 18, 2010
By

This is the second post (previous one here) that provides an analysis of Cramer's stock recommendations based upon the Mad Money Stock Screener as of 10/15/2010.Recommendations by SegmentAs mentioned in the previous post, ...

October 18, 2010
By

## Winners of 2010 ggplot2 case study competition

October 18, 2010
By

The winners of this year's ggplot2 case study competition have been announced. I was honoured to be asked to be a judge of the competition this year, but it was a difficult job with so many excellent entries. In the end, the judging panel (which included Heike Hoffman and Hadley Wickham and me) selected three entries which each demonstrated...

## New Housedata release with October 2010 filings.

October 18, 2010
By

The latest Offensive Politics Housedata file has been released. This file covers 51,463 individual electronic FEC filings for US House committees from 2001 to 10/18/2010.