Blog Archives

Course Profiles in ggplot2

April 23, 2015
By
Course Profiles in ggplot2

Over on my other blog, Droppin’ The Hammer… where I journal my ultra running experience, I featured a novel ggplot2 plot for highlight elevation gain and loss on running race course profiles. If you’ve never run a particular race you...

Read more »

Blowing Away the Competition

April 22, 2015
By
Blowing Away the Competition

In February I embarked on a mission to speed up R, and I’m very pleased with the results so far. I redesigned the internal string cache, symbol table, and environments by using a somewhat obscure data structure called an Array Hash. It’s ba...

Read more »

Hash Table Performance in R: Part IV

April 21, 2015
By
Hash Table Performance in R: Part IV

In the last post I introduced the package envestigate that provides the hash table structure and interesting statistics associated with an R environment. Now I want to show you some performance characteristics of the R environment as a hash table. I&r...

Read more »

Hash Table Performance in R: Part III In Part I of this series, I explained how R hashed…

April 17, 2015
By

Hash Table Performance in R: Part III In Part I of this series, I explained how R hashed environments are superior to vectors or lists when you’re in need of an associative array. In Part II I explained the three main operations you want to perform on hash tables and how you should implement them for optimal performance. This time around...

Read more »

Hash Table Performance in R: Part II In Part I of this series, I explained how R hashed…

April 13, 2015
By

Hash Table Performance in R: Part II In Part I of this series, I explained how R hashed environments are superior to vectors or lists when you need a hash table for your work. I also teased that in this post I would explain the caveats associated with that choice, but I’m saving that for later as I...

Read more »

Hash Table Performance in R: Part I

March 24, 2015
By
Hash Table Performance in R: Part I

What Is It?A hash table, or associative array, is a well known key-value data structure. In R there is no equivalent, but you do have some options. You can use a vector of any type, a list, or an environment.But as you’ll see with all of these options their performance is compromised in some way. In the...

Read more »

It’s Time For A Change: A Shiny One

December 3, 2012
By

I presented rApache to the public for the first time at the Directions in Statistical Computing workshop in August 2005 (paper), almost seven years ago. It might have been novel, maybe even crazy at the time, but I think rApache showed people a new way to bring R to the web. I presented brew, a templating framework for...

Read more »

RMySQL Looking For A New Maintainer

November 19, 2012
By

Please contact me if you’d like to take over maintainership of this popular R package.

Read more »

Innovation in Statistical Computing

November 15, 2012
By

In A Capitalist’s Dilemma, Whoever Wins on Tuesday, Clayton Christensen lays out three kinds of innovations through which an industry cycles: Empowering Innovations - those that offer products and services to a new customer base. The classic empowering (or disruptive) innovation is Ford Motor Company’s introduction of the low-cost Model T coupled with the ability of Ford’s own...

Read more »

Deploy Rook Apps: Part II

October 17, 2012
By

In Part I, I described how you can deploy your Rook applications with rApache. This post describes how you can do it with R itself. But before we get into that, I’d like to explain the off-again on-again relationship Rook has had with CRAN, R’s package archive network. Since inception (of Rook, not the movie), I...

Read more »