Monthly Archives: July 2012

RcppCNPy 0.2.0

July 31, 2012
By

Version 0.2.0 of the recently introduced RcppCNPy package for reading/writing NumPy data in R arrived on CRAN earlier today.The main change are the added ability to also write gzip-ed npy files, to suppress an automatic transposition as well as th...

Read more »

The Environmental Performance Index, visualized with R

July 31, 2012
By
The Environmental Performance Index, visualized with R

The Environmental Performance Index (EPI) ranks countries on performance indicators for environmental public health and ecosystem vitality. Yale University hosts the EPI website, which was used to present the 2012 EPI Rankings to world leaders at the 2012 World Economic Forum at Davos. The Country Profiles section of the website allowed members to browse the performance characteristics of their...

Read more »

Twitter analysis of air pollution in Beijing

July 31, 2012
By
Twitter analysis of air pollution in Beijing

One of the air pollution detection machine in Beijing (at the American Embassy) is connected to Twitter and tweet about the air quality in real time. By default the machine in Beijing output the 24hr summary PM2.5 air pollution information. What is PM2.5 is define here Next will be to compare the...

Read more »

Fun with geocoding and mapping in JGR

July 31, 2012
By
Fun with geocoding and mapping in JGR

For a recent project I had to do some mapping of addresses, but I didn’t have there lat/lons do use the Deducer and DeducerSpatial packages in R JGR.  After frustrating myself trying to adapt this code from stackoverflow.com, I found a much easier way of geocoding using the dismo and XML packages in R. First

Read more »

Text and symbol size in multi-panel figures in R

July 31, 2012
By
Text and symbol size in multi-panel figures in R

In R, there are a couple of packages that allow you to create multi-panel figures (see examples here and here), but, of course, you can also make multi-panel figures in the base package*. Below I provide a simple example for creating a multi-panel figure in the R base package with the focus on making the

Read more »

Edge Prediction in a Social Graph: My Solution to Facebook’s User Recommendation Contest on Kaggle

July 31, 2012
By
Edge Prediction in a Social Graph: My Solution to Facebook’s User Recommendation Contest on Kaggle

A couple weeks ago, Facebook launched a link prediction contest on Kaggle, with the goal of recommending missing edges in a social graph. I love investigating social networks, so I dug around a little, and since I did well enough to score one of the coveted prizes, I’ll share my approach here.(For some background, the contest provided...

Read more »

Application of Horizon Plots

July 31, 2012
By
Application of Horizon Plots

for background please see prior posts Horizon Plot Already Available and Cubism Horizon Charts in R Good visualization simplifies, and stories are better told with effective and pretty visualizations. Although horizon plots are not immediately intuitiv...

Read more »

Multidimensional Scaling and Company Similarity

July 30, 2012
By
Multidimensional Scaling and Company Similarity

Background and ideaOften we are looking at a particular sector, and want to get a quick overview of a group of companies relative to one another. I thought I might apply Multidimensional Scaling (MDS) to various financial ratios and see if it...

Read more »

Making R graphics legible in presentation slides

July 30, 2012
By
Making R graphics legible in presentation slides

I only visited a few JSM sessions today, as I’ve been focused on preparing for my own talk tomorrow morning. However, I went to several talks in a row which all had a common problem that made me cringe: graphics … Continue reading →

Read more »

Yet Another Forecast Dashboard

July 30, 2012
By
Yet Another Forecast Dashboard

Recently, I came across quite a few examples of time series forecasting using R. Here are some examples: Time series cross-validation 4: forecasting the S&P 500 Holt-Winters forecast using ggplot2 Autoplot: Graphical Methods with ggplot2 Large-Scale Parallel Statistical Forecasting Computations in R (2011) by M. Stokely, F. Rohani, E. Tassone Forecasting time series data ARIMA

Read more »