What’s for lunch? Private browsing.

August 23, 2010
By
What’s for lunch? Private browsing.

Over at the Mozilla Metrics blog, Mozillan Hamilton Ulmer uses R and ggplot2 to look at when people (or at least, Firefox users that volunteered to share their usage data) enable private browsing. Turns out it isn't just "porn mode" after all: the main use turns out to be lunchtime browsing away from the employer's prying eyes: Follow the...

Read more »

Tips for the R beginner (a 5 page overview)

August 23, 2010
By

In this post I publish a PDF document titled “A collection of tips for R in Finance”. It is a basic 5 page introduction to R in finances by Arnaud Amsellem (linked in profile). The article offers tips related to the following points: Code Editor Organizing R code Update packages Getting external data into R Communicating with external applications...

Read more »

Taking R to the Limit: Parallelism and Big Data

August 23, 2010
By

In a two-part series at the Los Angeles R User Group, Ryan Rosario took a look at the many ways you can take the R language to the limits of high-performance computing. In Part I (see video at this link; slides and code also available), Ryan focuses on the various methods of parallel computing in R. There's some great...

Read more »

Leveraging the Wisdom of Crowds for Fantasy Football

August 23, 2010
By
Leveraging the Wisdom of Crowds for Fantasy Football

WARNING: This has nothing to do with national security, but is nonetheless awesome. This evening I will be participating in that great annual tradition which marks the transition from Summer to Fall: the fantasy football draft. A large part of having a successful fantasy football draft is being able to adjudicate the value of a player more accurately

Read more »

TripleR round-up

August 23, 2010
By
TripleR round-up

GSoC 2010 is over – here’s the harvest from my project: TripleR 0.4.3 is the current stable version – and it is a major milestone in its development. Now for the first time social relations round robin designs can be analyzed in R. All results have been cross-checked with TripleR’s DOS predecessor SOREMO.exe, and all

Read more »

Tools for Hacking R: Subversion

August 23, 2010
By

The development version of R is stored in a Subversion repository at the URL http://svn.r-project.org/R/trunk/. In fact, you can browse the source code by clicking the link. Subversion Hierarchy Subversion is software for source code revision control. That means it keeps track of changes, who made them, when they were made, and any comments about

Read more »

Abstract word clouds using R

August 23, 2010
By
Abstract word clouds using R

A recent question over at BioStar asked whether abstracts returned from a PubMed search could easily be visualised as “word clouds”, using Wordle. This got me thinking about ways to solve the problem using R. Here’s my first attempt, which demonstrates some functions from the RCurl and XML packages. update: corrected a couple of copy/paste

Read more »

A small and lonely sea urchin…

August 22, 2010
By
A small and lonely sea urchin…

A few weeks ago, a paper on which I am a co-author was accepted for publication in the french ecological journal Life & Environment. In this paper, we evaluate the consequences of recreative harvesting on three populations of sea urchins (…)Read the rest of this entry »

Read more »

Newcomb, Benford, and their Dirty, Dirty Logarithms

August 22, 2010
By
Newcomb, Benford, and their Dirty, Dirty Logarithms

Tom Taverner introduced me to Benford’s Law as we were eating lunch together at a statistical computing conference: If you look at the first digits of data in many naturally-occuring datasets, a startling 30 percent of them are ones. “Pah!” I said. “That belies intuition! Why would one digit occur any more than another? I’d

Read more »

Traffic prediction contest closing soon

August 22, 2010
By

A quick reminder that the IEEE traffic-prediction competition closes soon. If you're thinking of entering you'll need to get the description of your R-based solution in by September 13. IEEE ICDM Contest: Road Traffic Prediction for Intelligent GPS Navigation

Read more »

Global Temperature Proxy Reconstructions ~ Bayesian extrapolation of warming w/ rjags

August 22, 2010
By
Global Temperature Proxy Reconstructions ~ Bayesian extrapolation of warming w/ rjags

Update: fixed projection. There are a bunch of “hockey sticks” that calculate past global temps. through the use of proxies when instrumental data is absent. There is a new one out there by McShane and Wyner (2010) that’s creating quite a stir in the blogosphere (here, here, here, here). The main take out being, that

Read more »

Dump R datasets into a single file

August 21, 2010
By
Dump R datasets into a single file

Should you need datasets that come with R and additional packages (you can access them via data()) in one single file, here’s what I did to dump the entire workspace into one file: This code can easily be adapted to dump individual dataset into its own file.

Read more »

Using R for Introductory Statistics, Chapter 3.4

August 21, 2010
By
Using R for Introductory Statistics, Chapter 3.4

...a continuing journey through Using R for Introductory Statistics, by John Verzani. Simple linear regression Linear regression is a kooky term for fitting a line to some data. This odd bit of terminology can be blamed on Sir Francis Galton, a proli...

Read more »

Using R for Introductory Statistics, Chapter 3.4

August 21, 2010
By
Using R for Introductory Statistics, Chapter 3.4

...a continuing journey through Using R for Introductory Statistics, by John Verzani. Simple linear regression Linear regression is a kooky term for fitting a line to some data. This odd bit of terminology can be blamed on Sir Francis Galton, a proli...

Read more »

Map of Upcoming Ruby Conferences

August 21, 2010
By
Map of Upcoming Ruby Conferences

One of the top searches on rubyflow is “conference”.  A recent post showed how to create a map with the location of the 2010 R User Conference.  So why not expand on the subject and create a map with numerous conference locations thr...

Read more »

Managing Market Studies in R

August 21, 2010
By
Managing Market Studies in R

I'm currently working on seasonal studies for various markets and have decided it's high time I got an organized workflow established. How does sugar behave in August every year? Is it random or are there some fundamental drivers that coerce its behavi...

Read more »

swing graph

August 21, 2010
By

I’m updating a swing dotplot PDF every 10 minutes as the count progresses (and the cool part is that the updates continue even as I’m flying Heathrow to SFO).

Read more »

Weekend art in R (Part 3)

August 21, 2010
By
Weekend art in R (Part 3)

I have a few posts nearing completion, but meanwhile a weekend break for art. Big thanks to Simon Urbanek and Jeffrey Horner, creators of Cairo, a library for the programming language R. Have you noticed how R can’t anti-alias (fancy way for saying smooth out lines and curves when creating a bit-mapped image)? Cairo can.

Read more »

Using JAGS in R with the rjags Package

August 20, 2010
By

Get Everything Set Up I’m going to assume that you have access to a machine that will run JAGS. If you don’t, then you should be able to use WinBUGS, which is very easy to get set up. Unfortunately, the details of what follows may not help you as much if you’re using WinBUGS. To

Read more »

Automatic Differentiation in R

August 20, 2010
By
Automatic Differentiation in R

project outcomes —————- radx: forward automatic differentiation in R tada: templated automatic differentiation in C++ development summary ——————- During the summer of 2010, under the Google Summer of Code program, I was assigned the project of implementing an engine for Automatic Differentiation in R. The implementation involved building a fully functional system for computing numerical

Read more »

Taking R to the Limit, Part II – Large Datasets in R

August 20, 2010
By
Taking R to the Limit, Part II – Large Datasets in R

For Part I, Parallelism in R, click here. Tuesday night I again had the opportunity to present on high performance computing in R, at the Los Angeles R Users’ Group. This was the second part of a two part series called “Taking R to the Limit: High Performance Computing in R.” Part II discussed ways to work with large datasets...

Read more »

How extreme is the Russian heatwave?

August 20, 2010
By
How extreme is the Russian heatwave?

The devastating heatwave in Russia now seems to be over, but not before killing thousands, causing extensive wildfires, and destroying crops. But how severe was this heatwave, compared to past summers? Physicist and climate scientist Joe Wheatley looks at the record of temperature and rainfall in Russia over the last 60 years and places the last 3 months in...

Read more »

Phylogenetic trees online

August 20, 2010
By

The other day, an article was published in PLoS One describing a newly developed JavaScript library to visualise phylogenetic trees online: jsPhyloSVG. It's pretty nifty, and there's some pretty cool functionality that you can build into the trees. It's all based on the PhyloXML standard for describing phylogenetic trees and networks, but can display trees...

Read more »

Phylogenetic trees online

August 20, 2010
By

The other day, an article was published in PLoS One describing a newly developed JavaScript library to visualise phylogenetic trees online: jsPhyloSVG. It's pretty nifty, and there's some pretty cool functionality that you can build into the trees. It's all based on the PhyloXML standard for describing phylogenetic trees and networks, but can display trees...

Read more »

R courses from Statistics.com

August 19, 2010
By

The fine folks at Statistics.com have a number of courses related to R coming up in the next few months, including what looks to be a very useful course in handling data with R from none other than R Core Team member Paul Murrell. The courses from Statistics.com are on-line, so you can participate on your own schedule and...

Read more »

Speeding up parentheses (and lots more) in R

August 19, 2010
By
Speeding up parentheses (and lots more) in R

As I noted here, enclosing sub-expressions in parentheses is slower in R than enclosing them in curly brackets. I now know why, and I’ve modified R to reduce (but not eliminate) the slowness of parentheses. The modification speeds up many other operations in R as well, for an average speedup of something like 5% for

Read more »

A brief introduction to “apply” in R

August 19, 2010
By
A brief introduction to “apply” in R

At any R Q&A site, you’ll frequently see an exchange like this one: Q: How can I use a loop to ? A: Don’t. Use one of the apply functions. So, what are these wondrous apply functions and how do they work? I think the best way to figure out anything in

Read more »

R/Rmetrics at BaselR

August 19, 2010
By

(This article was first published on Rmetrics blogs, and kindly contributed to R-bloggers) To leave a comment for the author, please follow the link and comment on his blog: Rmetrics blogs. R-bloggers.com offers daily e-mail updates about R news and tutorials on topics such as: visualization (ggplot2, Boxplots, maps, animation), programming (RStudio, Sweave, LaTeX, SQL, Eclipse, git, hadoop, Web...

Read more »

Myths about Ciudad Juarez

August 18, 2010
By
Myths about Ciudad Juarez

Last year there were over 2,600 murders in Ciudad Juarez, and if the more than 1,800 murders so far this year are any indications, there will be even more murders in 2010. Ciudad Juarez is a scary place, but it wasn't always that way... I learned from Noel Maurer's Blog that Ciudad Juarez used to have a low murder...

Read more »