Monthly Archives: April 2010

Because it’s Friday: Chatroulette

April 2, 2010
By

Yesterday, Drew Conway posted an analysis of the survival time to events on Chatroulette. If you're familiar with Chatroulette, you'll know what kind of events you can expect to occur when using it. (If you're not, here's a hint: don't try it now if you're at work.) Sadly, it was all an April Fool's Day joke. But Drew takes...

Read more »

A free book on Geostatistical Mapping with R

April 2, 2010
By
A free book on Geostatistical Mapping with R

Tomislav Hengl of the University of Amsterdam has published new book, A Practical Guide to Geostatistical Mapping. It's jam-packed with 291 pages on mapping and analyzing spatial data using free software including R, SAGA, GRASS, ILWIS and Google Earth, and freely-available map data. The book itself is also available for free, as an Open Access Publication. You can order...

Read more »

How to Produce Fake Data Analysis in R: 3 Easy Steps

April 2, 2010
By
How to Produce Fake Data Analysis in R: 3 Easy Steps

Did you really think that a team of researchers spent their weekends counting the number of shirtless adolescent men and exposed penises they could find on charoulette.com? Perhaps you should not answer that, as it may be a better measure of your opinion of sociologist than gullibility. It is true, sociologist do say the

Read more »

CLT Standard Normal Generator

April 2, 2010
By
CLT Standard Normal Generator

I’ve found this standard normal random number generator in a number of places, one of which being from one of Paul Wilmott’s books. The idea is that we can use the Central Limit Theorem (CLT) to easily generate values distributed according to a standard normal distribution by using the sum of 12 uniform random

Read more »

Lookup Performance in R

April 2, 2010
By

Rumor has it that Joe Adler, author of the O’Reilly Book R in a Nutshell, has joined Linked In as a data scientist.  But that does not keep him from still pumping out some interesting content over at OReilly.com. His latest article is about lookup performance in R. He does a great job giving code

Read more »

Opening Statements on Markov Chain Monte Carlo

April 1, 2010
By
Opening Statements on Markov Chain Monte Carlo

This quarter I am TAing UCLA’s Statistics 102C. Introduction to Monte Carlo Methods for Professor Qing Zhou. This course did not exist when I was an undergraduate, and I think it is pretty rare to teach Monte Carlo (minus the bootstrap if you count that) or MCMC to undergrads. I am excited about this class because to me, MCMC...

Read more »

Frank Harrell’s Regression Modeling Strategies Course Handouts

April 1, 2010
By

The previously mentioned Regression Modeling Strategies short course taught by Frank Harrell is nearly over. Here are the handouts (PDF) from the course. Keep an eye out here, I'll be writing a few more posts in the near future on topics Frank covered...

Read more »

Quantile LOESS – Combining a moving quantile window with LOESS (R function)

April 1, 2010
By
Quantile LOESS – Combining a moving quantile window with LOESS (R function)

In this post I will provide R code that implement’s the combination of repeated running quantile with the LOESS smoother to create a type of “quantile LOESS” (e.g: “Local Quantile Regression”). This method is useful when the need arise to fit robust and resistant (Need to be verified) a smoothed line for a quantile (an example for such a...

Read more »

Because it’s Thursday: Epidemiology of the Undead

April 1, 2010
By
Because it’s Thursday: Epidemiology of the Undead

Noted statistician Andrew Gelman has teamed up with occultist George Romero to address the most serious public-health threat of out time: Zombies. They've published a paper in the journal Biomastika, "How many zombies do you know?" to propose the use of indirect survey methods to measure outbreaks of the undead: Abstract: The zombie menace has so far been studied...

Read more »

Plots in R and the ImageJ visualization

April 1, 2010
By
Plots in R and the ImageJ visualization

If you plot data in R and you would like to display the same data in the ImageJ view it is necessary to transfer the data matrix to ImageJ. The first thing which can be noticed is that the image data is displayed rotated because of the Bio7 approach to transfer data forth and back

Read more »