## Econometrics and R

August 30, 2010
By

Econometricians seem to be rather slow to adopt new methods and new technology (compared to other areas of statistics), but slowly the use of R is spreading. I’m now receiving requests for references showing how to use R in econometrics, and so I thought it might be helpful to post a few suggestions here. A

## Hyper-g priors

August 30, 2010
By
$Hyper-g priors$

Earlier this month, Daniel Sabanés Bové and Leo Held posted a paper about g-priors on arXiv. While I glanced at it for a few minutes, I did not have the chance to get a proper look at it till last Sunday. The g-prior was first introduced by the late Arnold Zellner for (standard) linear models,

## The Chosen One

August 30, 2010
By

Toss one hundred different balls into your basket. Shuffle them up and select one with equal probability amongst the balls. That ball you just selected, it’s special. Before you put it back, increase its weight by 1/100th. Then put it back, mix up the balls and pick again. If you do this enough, at some

## Stochastic Simulation With Copulas in R

August 30, 2010
By

A friend of mine gave me a call last week and was wondering if I had a little R code that could illustrate how to do a Cholesky decomposition. He ultimately wanted to build a Monte Carlo model with correlated variables. I pointed him to a number of packages that do Cholesky decomp but then

August 30, 2010
By

Once you've downloaded PDQ with a view to solving your performance-related questions, the next step is getting started using it. Why not have some fun with blocks? Fun-ctional blocks, that is. Since all digital computers and network systems can be considered as a collection of functional blocks and these blocks often contain buffers, their performance can be modeled...

August 30, 2010
By

Once you've downloaded PDQ with a view to solving your performance-related questions, the next step is getting started using it. Why not have some fun with blocks? Fun-ctional blocks, that is. Since all digital computers and network systems can be considered as a collection of functional blocks and these blocks often contain buffers, their performance can be modeled...

## Taking R to the Limit: Large Datasets; Predictive modeling with PMML and ADAPA

August 30, 2010
By

During the first part of our meeting, Ryan Rosario presented on the topic of large datasets in R. Video, slides and code of the talk “Taking R to the Limit: Large Datasets” by Ryan Rosario at the Los Angeles area … Continue reading →

## Sweet bar chart o’ mine

August 30, 2010
By

Last week I was asked to visualise some heart rate data from an experiment. ... The standard way of displaying a time series (that is, a numeric variable that changes over time) is with a line plot. ... The experimenters, however, wanted a bar chart. I hadn't considered this use of a barchart before, so it was interesting...

## Example 8.3: pyramid plots

August 30, 2010
By

Pyramid plots are a common way to display the distribution of age groups in a human population. The percentages of people within a given age category are arranged in a barplot, often back to back. Such displays can be used distinguish males vs. femal...

## Wanted: R Analysis of New Scientist Covers

August 30, 2010
By

Peter Aldhous and Jim Giles -- from New Scientist's San Francisco bureau -- are looking for a statistician and R user to take part in an interesting data analysis challenge, and also be part of a future article in the magazine. They were inspired by this rather tongue-in-cheek presentation where Sebastian Wernicke analyzed videos, transcripts and ratings of TED...

## US House Election Results Visualized Five Ways

August 30, 2010
By

The Democratic major-party vote share of US House elections 2002-2008 visualized 5 different ways.

## Graphing Highly Skewed Data

August 30, 2010
By

Graphing data with a few outliers is challenging, and some solutions are better than others. Here is a comparison of the alternatives.

## GEO database: curation lagging behind submission?

August 30, 2010
By

I was reading an old post that describes GEOmetadb, a downloadable database containing metadata from the GEO database. We had a brief discussion in the comments about the growth in GSE records (user-submitted) versus GDS records (curated datasets) over time. Below, some quick and dirty R code to examine the issue, using the Bioconductor GEOmetadb

## MCMC Diagnostics in R with the coda Package

August 29, 2010
By

This is a follow up to my recent post introducing the use of JAGS in R through the rjags package. In the comments on that post, Bernd Weiss encouraged me to write a short addendum that describes diagnostic functions that you should use to assess the output from an MCMC sampler. I’ve only been using

## Beta translation done!

August 29, 2010
By

Once my team of four translators had handed back to me all the chapters of the French version of Introducing Monte Carlo Methods with R to me, I had to go over the book to ensure some minimal consistency between the chapters. I started the editing in the plane to Vancouver but did not get

August 29, 2010
By

## Subset views in R

August 28, 2010
By

I don’t know how to do this in R. So let me just say why I can’t. I wanted something akin to Boost‘s sub-matrix views, where you can have indexes map back to the original matrix, so you don’t create … Continue reading →

## Blegging for Data

August 28, 2010
By

I’m in the middle of a new project that involves analyzing the packages that are currently on CRAN. As part of my work, I could really benefit from information about which packages are installed on people’s computers. If you’re willing to part with a bit of your time and privacy, I’d very much appreciate you

## Patrick Burns is blogging

August 28, 2010
By

Patrick Burns is the author of several helpful R resources, including A Guide for the Unwilling S User, The R Inferno, and S Poetry. He also wrote one of my favorite critiques of Microsoft Excel: Spreadsheet Addiction. His writing is witty, entertain...

## Mike’s CNC 2010-08-27 18:36:00

August 27, 2010
By

Support the OpenGov idea to create a "Platform for number crunchers across (US Federal) government" HERE. A small team is building a small pilot and I'm happy to report that R appears on many of the posts. If you like the idea (or even if you don't),...

## Mike’s CNC 2010-08-27 18:36:00

August 27, 2010
By

Support the OpenGov idea to create a "Platform for number crunchers across (US Federal) government" HERE. A small team is building a small pilot and I'm happy to report that R appears on many of the posts. If you like the idea (or even if you don't),...

## Fractals in R

August 27, 2010
By

Atte Tenkanen had a blog on fractals using R for a time. Much of his source code is still available online.  To produce his version of the Mandelbrot set:source('http://users.utu.fi/attenka/mandelbrot_set.R')Fractals (such...

## Fractals in R

August 27, 2010
By

Atte Tenkanen had a blog on fractals using R for a time. Much of his source code is still available online.  To produce his version of the Mandelbrot set:source('http://users.utu.fi/attenka/mandelbrot_set.R')Fractals (such...

## Because it’s Friday: How Machines Work

August 27, 2010
By

Ever wondered how a sewing machine seemingly manages to knot stitches without ever releasing the thread? Well, wonder no more: Find this and other animations of marvels of engineering, including the universal velocity joint and the rotary engine, at the link below. World Of Technology: Complicated Mechanisms Explained in simple animations (via) http://mytechnologyworld9.blogspot.com/2010/08/complicated-mechanisms-explained-in.html

## Poll: Half of SAS users considering a switch

August 27, 2010
By

A recent poll of KD Nuggets readers suggests that of those using SAS today, almost half (49.6%) are considering switching to a different system for statistical analysis. The poll was prompted by the recent high court decision in the UK, that affirmed that "WPS is lawful clone of SAS system" (as stated in a WPS press release). The exact...

## Matlab-style multiple assignment in R

August 26, 2010
By

R again! You know how in Matlab you can do? S, I = sort(M) I like that. R generic functions makes this possible. First, let’s genericize assignment. I feel like regular “=” and “<-” oughta stay nongeneric, so let’s make … Continue reading →

## ProjectTemplate

August 26, 2010
By

Introduction As many people already know, I’ve recently uploaded a new R package called ProjectTemplate to GitHub and CRAN. The ProjectTemplate package provides a function, create.project(), that automatically builds a directory for a new R project with a clean sub-directory structure and automatic data and library loading tools. My hope is that standardized data loading,

## Oh (de)bugger!

August 26, 2010
By

By number of questions asked, R passed MATLAB for the first time on Stack Overflow today. Thus it seems an appropriate time to write my first R-based post. This post concerns what to  do when your R-code goes pear shaped. Back in June there were a couple of very good videos on R debugging that