Blog Archives

Slides and replay of my “Using R with Hadoop” webinar now available #rstats #hadoop

January 25, 2013
By
Slides and replay of my “Using R with Hadoop” webinar now available #rstats #hadoop

I owe a big “thank you” to all of you who attended my webinar yesterday “Using R with Hadoop”. Revolution Analytics partnered with us at Think Big Analytics to produce the webinar, and I owe them thanks as well. For those of you who missed it, the slides and replay are now available from Revolution

Read more »

Slides from “Tapping the Data Deluge with R” lightning talk #rstats #PAWCon

October 2, 2012
By
Slides from “Tapping the Data Deluge with R” lightning talk #rstats #PAWCon

Here is my presentation from last night’s Boston Predictive Analytics Meetup graciously hosted by Predictive Analytics World Boston. The talk is meant to provide an overview of (some) of the different ways to get data into R, especially supplementary data sets to assist with your analysis. All code and data files are available at github:

Read more »

Slides from today’s Big Data Step-by-Step Tutorials: Infrastructure series and Intro to R+Hadoop with RHadoop’s rmr

March 10, 2012
By
Slides from today’s Big Data Step-by-Step Tutorials: Infrastructure series and Intro to R+Hadoop with RHadoop’s rmr

Slides from the Boston Predictive Analytics Big Data Workshop tutorials: Big Data Step-by-Step: Infrastructure 1/3: Local VM Big Data Step-by-Step: Infrastructure 2/3: Running R and RStudio on EC2 Big Data Step-by-Step: Infrastructure 3/3: Taking it to the cloud... easily... with Whirr Big Data Step-by-Step: Using R & Hadoop (with RHadoop's rmr package)

Read more »

Use geom_rect() to add recession bars to your time series plots #rstats #ggplot

August 15, 2011
By
Use geom_rect() to add recession bars to your time series plots #rstats #ggplot

ggplot2's geom_rect() layer makes it easy to highlight portions of your graph, such as recessions on an economic time series.

Read more »

One-liners which make me love R: twitteR’s searchTwitter() #rstats

July 21, 2011
By
One-liners which make me love R: twitteR’s searchTwitter() #rstats

R reminds me a lot of English. It’s easy to get started, but very difficult to master. So for all those times I’ve spent… well, forever… trying to figure out the “R way” of doing something, I’m glad to share these quick wins. My recent R tutorial on mining Twitter for consumer sentiment wouldn’t have

Read more »

One-liners which make me love R: Make your data dance (Hans Rosling style) with googleVis #rstats

July 14, 2011
By
One-liners which make me love R: Make your data dance (Hans Rosling style) with googleVis #rstats

This inaugural post in my "one-liners which make me love R" series highlights the googleVis package which makes it easy to use the Google Visualization API from R. Thanks to googleVis, just one line of R generates the 165 lines of HTML and (mostly) JavaScript required to create a Hans Rosling-style motion chart for some sample data.

Read more »

installing R 2.13.1 on Amazon EC2′s “Amazon Linux” AMI #rstats

July 8, 2011
By
installing R 2.13.1 on Amazon EC2′s “Amazon Linux” AMI #rstats

Condensed from this post (and comments) on David Chudzicki’s blog, tweaked, and updated for R-2.13.1. Assumes you’re starting with a virgin “Amazon Linux” AMI. I picked “Basic 64-bit Amazon Linux AMI 2011.02.1 Beta” (AMI Id: ami-8e1fece7) because it was marked as free tier eligible on the “Quick Start” tab of AWS’s “Launch Instance” dialog box:

Read more »

slides from my R tutorial on Twitter text mining #rstats

July 4, 2011
By
slides from my R tutorial on Twitter text mining #rstats

Update: An expanded version of this tutorial will appear in the new Elsevier book Practical Text Mining and Statistical Analysis for Non-structured Text Data Applications by Gary Miner et. al which is now available for pre-order from Amazon. In conjunction with the book, I have cleaned up the tutorial code and published it on github.

Read more »

googleVis-0.2.4 requires older version of RJSONIO (0.5-0) #rstats

May 29, 2011
By
googleVis-0.2.4 requires older version of RJSONIO (0.5-0) #rstats

Downgrade to RJSONIO 0.5 if you want to be able to set initial states with the current googleVis package. Otherwise, there will be lots of extra backslashes in your JavaScript--and blank pages in your browser.

Read more »

quantmod makes it easy to watch silver prices crash in R #rstats

May 7, 2011
By
quantmod makes it easy to watch silver prices crash in R #rstats

Jeffrey Ryan's quantmod package makes it simple to download and graph pricing data from a variety of sources. A couple of lines of R is all it takes to see that silver has had a very bad week.

Read more »