## What’s for lunch? Private browsing.

August 23, 2010
By

Over at the Mozilla Metrics blog, Mozillan Hamilton Ulmer uses R and ggplot2 to look at when people (or at least, Firefox users that volunteered to share their usage data) enable private browsing. Turns out it isn't just "porn mode" after all: the main use turns out to be lunchtime browsing away from the employer's prying eyes: Follow the...

## Tips for the R beginner (a 5 page overview)

August 23, 2010
By

In this post I publish a PDF document titled “A collection of tips for R in Finance”. It is a basic 5 page introduction to R in finances by Arnaud Amsellem (linked in profile). The article offers tips related to the following points: Code Editor Organizing R code Update packages Getting external data into R Communicating with external applications...

## Taking R to the Limit: Parallelism and Big Data

August 23, 2010
By

In a two-part series at the Los Angeles R User Group, Ryan Rosario took a look at the many ways you can take the R language to the limits of high-performance computing. In Part I (see video at this link; slides and code also available), Ryan focuses on the various methods of parallel computing in R. There's some great...

## Leveraging the Wisdom of Crowds for Fantasy Football

August 23, 2010
By

WARNING: This has nothing to do with national security, but is nonetheless awesome. This evening I will be participating in that great annual tradition which marks the transition from Summer to Fall: the fantasy football draft. A large part of having a successful fantasy football draft is being able to adjudicate the value of a player more accurately

## TripleR round-up

August 23, 2010
By

GSoC 2010 is over – here’s the harvest from my project: TripleR 0.4.3 is the current stable version – and it is a major milestone in its development. Now for the first time social relations round robin designs can be analyzed in R. All results have been cross-checked with TripleR’s DOS predecessor SOREMO.exe, and all

## Tools for Hacking R: Subversion

August 23, 2010
By

The development version of R is stored in a Subversion repository at the URL http://svn.r-project.org/R/trunk/. In fact, you can browse the source code by clicking the link. Subversion Hierarchy Subversion is software for source code revision control. That means it keeps track of changes, who made them, when they were made, and any comments about

## Abstract word clouds using R

August 23, 2010
By

A recent question over at BioStar asked whether abstracts returned from a PubMed search could easily be visualised as “word clouds”, using Wordle. This got me thinking about ways to solve the problem using R. Here’s my first attempt, which demonstrates some functions from the RCurl and XML packages. update: corrected a couple of copy/paste

## A small and lonely sea urchin…

August 22, 2010
By

A few weeks ago, a paper on which I am a co-author was accepted for publication in the french ecological journal Life & Environment. In this paper, we evaluate the consequences of recreative harvesting on three populations of sea urchins (…)Read the rest of this entry »

## Newcomb, Benford, and their Dirty, Dirty Logarithms

August 22, 2010
By

Tom Taverner introduced me to Benford’s Law as we were eating lunch together at a statistical computing conference: If you look at the first digits of data in many naturally-occuring datasets, a startling 30 percent of them are ones. “Pah!” I said. “That belies intuition! Why would one digit occur any more than another? I’d

## Traffic prediction contest closing soon

August 22, 2010
By

A quick reminder that the IEEE traffic-prediction competition closes soon. If you're thinking of entering you'll need to get the description of your R-based solution in by September 13. IEEE ICDM Contest: Road Traffic Prediction for Intelligent GPS Navigation

## Global Temperature Proxy Reconstructions ~ Bayesian extrapolation of warming w/ rjags

August 22, 2010
By
$Global Temperature Proxy Reconstructions ~ Bayesian extrapolation of warming w/ rjags$

Update: fixed projection. There are a bunch of “hockey sticks” that calculate past global temps. through the use of proxies when instrumental data is absent. There is a new one out there by McShane and Wyner (2010) that’s creating quite a stir in the blogosphere (here, here, here, here). The main take out being, that

## Dump R datasets into a single file

August 21, 2010
By

Should you need datasets that come with R and additional packages (you can access them via data()) in one single file, here’s what I did to dump the entire workspace into one file: This code can easily be adapted to dump individual dataset into its own file.

## Using R for Introductory Statistics, Chapter 3.4

August 21, 2010
By

...a continuing journey through Using R for Introductory Statistics, by John Verzani. Simple linear regression Linear regression is a kooky term for fitting a line to some data. This odd bit of terminology can be blamed on Sir Francis Galton, a proli...

## Using R for Introductory Statistics, Chapter 3.4

August 21, 2010
By

...a continuing journey through Using R for Introductory Statistics, by John Verzani. Simple linear regression Linear regression is a kooky term for fitting a line to some data. This odd bit of terminology can be blamed on Sir Francis Galton, a proli...

## Map of Upcoming Ruby Conferences

August 21, 2010
By

One of the top searches on rubyflow is “conference”.  A recent post showed how to create a map with the location of the 2010 R User Conference.  So why not expand on the subject and create a map with numerous conference locations thr...

## Managing Market Studies in R

August 21, 2010
By

I'm currently working on seasonal studies for various markets and have decided it's high time I got an organized workflow established. How does sugar behave in August every year? Is it random or are there some fundamental drivers that coerce its behavi...

## swing graph

August 21, 2010
By

I’m updating a swing dotplot PDF every 10 minutes as the count progresses (and the cool part is that the updates continue even as I’m flying Heathrow to SFO).

## Weekend art in R (Part 3)

August 21, 2010
By

I have a few posts nearing completion, but meanwhile a weekend break for art. Big thanks to Simon Urbanek and Jeffrey Horner, creators of Cairo, a library for the programming language R. Have you noticed how R can’t anti-alias (fancy way for saying smooth out lines and curves when creating a bit-mapped image)? Cairo can.

## Using JAGS in R with the rjags Package

August 20, 2010
By

Get Everything Set Up I’m going to assume that you have access to a machine that will run JAGS. If you don’t, then you should be able to use WinBUGS, which is very easy to get set up. Unfortunately, the details of what follows may not help you as much if you’re using WinBUGS. To

## Automatic Differentiation in R

August 20, 2010
By

project outcomes —————- radx: forward automatic differentiation in R tada: templated automatic differentiation in C++ development summary ——————- During the summer of 2010, under the Google Summer of Code program, I was assigned the project of implementing an engine for Automatic Differentiation in R. The implementation involved building a fully functional system for computing numerical

## Taking R to the Limit, Part II – Large Datasets in R

August 20, 2010
By

For Part I, Parallelism in R, click here. Tuesday night I again had the opportunity to present on high performance computing in R, at the Los Angeles R Users’ Group. This was the second part of a two part series called “Taking R to the Limit: High Performance Computing in R.” Part II discussed ways to work with large datasets...

## How extreme is the Russian heatwave?

August 20, 2010
By

The devastating heatwave in Russia now seems to be over, but not before killing thousands, causing extensive wildfires, and destroying crops. But how severe was this heatwave, compared to past summers? Physicist and climate scientist Joe Wheatley looks at the record of temperature and rainfall in Russia over the last 60 years and places the last 3 months in...

## Phylogenetic trees online

August 20, 2010
By

The other day, an article was published in PLoS One describing a newly developed JavaScript library to visualise phylogenetic trees online: jsPhyloSVG. It's pretty nifty, and there's some pretty cool functionality that you can build into the trees. It's all based on the PhyloXML standard for describing phylogenetic trees and networks, but can display trees...

## Phylogenetic trees online

August 20, 2010
By

The other day, an article was published in PLoS One describing a newly developed JavaScript library to visualise phylogenetic trees online: jsPhyloSVG. It's pretty nifty, and there's some pretty cool functionality that you can build into the trees. It's all based on the PhyloXML standard for describing phylogenetic trees and networks, but can display trees...

## R courses from Statistics.com

August 19, 2010
By

The fine folks at Statistics.com have a number of courses related to R coming up in the next few months, including what looks to be a very useful course in handling data with R from none other than R Core Team member Paul Murrell. The courses from Statistics.com are on-line, so you can participate on your own schedule and...

## Speeding up parentheses (and lots more) in R

August 19, 2010
By

As I noted here, enclosing sub-expressions in parentheses is slower in R than enclosing them in curly brackets. I now know why, and I’ve modified R to reduce (but not eliminate) the slowness of parentheses. The modification speeds up many other operations in R as well, for an average speedup of something like 5% for

## A brief introduction to “apply” in R

August 19, 2010
By

At any R Q&A site, you’ll frequently see an exchange like this one: Q: How can I use a loop to ? A: Don’t. Use one of the apply functions. So, what are these wondrous apply functions and how do they work? I think the best way to figure out anything in

## R/Rmetrics at BaselR

August 19, 2010
By

August 18, 2010
By

Last year there were over 2,600 murders in Ciudad Juarez, and if the more than 1,800 murders so far this year are any indications, there will be even more murders in 2010. Ciudad Juarez is a scary place, but it wasn't always that way... I learned from Noel Maurer's Blog that Ciudad Juarez used to have a low murder...