Blog Archives

Rforecastio Package Update (1.1.0)

May 4, 2014
By
Rforecastio Package Update (1.1.0)

I’ve bumped up the version number of Rforecastio (github) to 1.1.0. The new features are: removing the SSL certificate bypass check (it doesn’t need it anymore) using plyr for easier conversion of JSON->data frame adding in a new daily forecast data frame roxygen2 inline documentation library(Rforecastio) library(ggplot2) library(plyr)   # NEVER put API keys in

Read more »

Moving From system() calls to Rcpp Interfaces

April 23, 2014
By

Over on the Data Driven Security Blog there’s a post on how to use Rcpp to interface with an external library (in this case ldns for DNS lookups). It builds on another post which uses system() to make a call to dig to lookup DNS TXT records. The core code is below and at both

Read more »

Mapping the March 2014 California Earthquake with ggmap

April 1, 2014
By
Mapping the March 2014 California Earthquake with ggmap

I had no intention to blog this, but @jayjacobs convinced me otherwise. I was curious about the recent (end of March, 2014) California earthquake “storm” and did a quick plot for “fun” and personal use using ggmap/ggplot. I used data from the Southern California Earthquake Center (that I cleaned up a bit and that you

Read more »

Guardian Words: Visualized

March 15, 2014
By

Andy Kirk (@visualisingdata) & Lynn Cherny (@arnicas) tweeted about the Guardian Word Count service/archive site, lamenting the lack of visualizations: Want to know num of words written in each day's Guardian paper by section + approx reading time? http://t.co/wP4W1EzUsx via @bengoldacre— Andy Kirk (@visualisingdata) March 15, 2014 This gave me a chance to bust out

Read more »

Using Twitter as a Data Source For Monitoring Password Dumps

February 20, 2014
By
Using Twitter as a Data Source For Monitoring Password Dumps

I shot a quick post over at the Data Driven Security blog explaining how to separate Twitter data gathering from R code via the Ruby t (github repo) command. Using t frees R code from having to be a Twitter processor and lets the analyst focus on analysis and visualization, plus you can use t

Read more »

One More (Yet-another?) Olympic Medal Live-tracking Shiny App

February 12, 2014
By

I’m posting this mostly to show how to: use the Google spreadsheet data-munging “hack” from the previous post in a Shiny context include it seamlessly into a web page, and run it locally without a great deal of wrangling The code for the app is in this gist. It is unsurprisingly just like some spiffy

Read more »

Live Google Spreadsheet For Keeping Track Of Sochi Medals

February 11, 2014
By

The “medals” R post by TRInker and re-blogged by Revolutions were both spiffy and a live example why there’s no point in not publishing raw data. You don’t need to have R (or any other language) do the scraping, though. The “IMPORTHTML” function (yes, function names seem to be ALL CAPS now over at Google

Read more »

Data Driven Security Roundup: betaPERT, Shiny, Honeypots, Passwords & Reproducible Research

February 9, 2014
By

Jay Jacobs (@jayjacobs)—my co-author of the soon-to-be-released book Data-Driven Security—& I have been hard at work over at the book’s sister-blog cranking out code to help security domain experts delve into the dark art of data science. We’ve covered quite a bit of ground since January 1st, but I’m using this post to focus more

Read more »

Lies, Damn Lies, “Data Journalism” and Charts That Don’t Start at 0

January 28, 2014
By
Lies, Damn Lies, “Data Journalism” and Charts That Don’t Start at 0

This tweet by @moorehn (who usually is a superb economic journalist) really bugged me: Alarming chart of employment for people between 25 and 54. It's like a ski jump. #SOTUecon pic.twitter.com/KNGYmwI88C— Heidi N. Moore (@moorehn) January 29, 2014 I grabbed the raw data from EPI: (http://www.epi.org/files/2012/data-swa/jobs-data/Employment%20to%20population%20ratio%20(EPOPs).xls) and properly started the graph at 0 for the

Read more »

Change The Default “Shell…” Action In RStudio for OS X

January 10, 2014
By

RStudio is my R development environment of choice and I work primarily on/in Mac OS X. While it’s great that Apple provides a built-in Terminal application, I prefer to use iTerm 2 when I need to do work at a shell. The fine folks at RStudio provide a handy Shell… menu item off the Tools

Read more »