Mideast Graph 3: Slate Middle East Friendship

July 23, 2014
By
Mideast Graph 3: Slate Middle East Friendship

Slate recently published a great info-graphic about Middle-East Relationships. It shows the relationships (Friend, Enemy, or Complicated) of 13 countries / organizations and the relationships between each pair. One draw back of the chart is that it doe...

Read more »

Making random draws from an arbitrarily defined pdf

July 23, 2014
By
Making random draws from an arbitrarily defined pdf

I recently found myself in need of a function to sample randomly from an arbitrarily defined probability density function. An excellent post by Quantitations shows how to accomplish this using some of Rs fairly sophisticated functional approximation to...

Read more »

Flood fill a region of an active device in R

July 23, 2014
By
Flood fill a region of an active device in R

The following is a function to "flood fill" a region on the active plotting device. Once called, the user will be asked to click on the desired target region. The flood fill algorithm then searches neighbors in 4 directions of the target cell (down, le...

Read more »

Uncovering the Preferences Shaping Consumer Data: Matrix Factorization

July 23, 2014
By
Uncovering the Preferences Shaping Consumer Data: Matrix Factorization

How do you limit your search when looking for a hotel? Those trying to save money begin with price. Members of hotel reward programs focus on their brand. At other times, location is first to narrow our consideration set. What does hotel search re...

Read more »

New data packages

July 23, 2014
By
New data packages

I’ve released four new data packages to CRAN: babynames, fueleconomy, nasaweather and nycflights13. The goal of these packages is to provide some interesting, and relatively large, datasets to demonstrate various data analysis challenges in R. The package source code (on github, linked above) is fully reproducible so that you can see some data tidying in

Read more »

Plotting the characteristic roots for ARIMA models

July 23, 2014
By
Plotting the characteristic roots for ARIMA models

When modelling data with ARIMA models, it is sometimes useful to plot the inverse characteristic roots. The following functions will compute and plot the inverse roots for any fitted ARIMA model (including seasonal models). # Compute AR roots arroots <- function(object) { if(class(object) != "Arima" & class(object) != "ar") stop("object must be of class Arima or ar") if(class(object) ==...

Read more »

Advanced R Profiling with pbdPAPI

July 22, 2014
By
Advanced R Profiling with pbdPAPI

R has some extremely useful utilities for profiling, such as system.time(), Rprof(), the often overlooked tracemem(), and the rbenchmark package. But if you want more than just simple timings of code execution, you will mostly have to look elsewhere. One of the best sources for profiling data is hardware performance counters, available in most modern hardware. This data can...

Read more »

Chart from R + Color from Javascript

July 22, 2014
By

Another color experiment combining resources from R and Javascript.  I just wish I could do Mean Phylogenetic Distance in Javascript like rPlotter.  I enjoyed using d3.js zoom behavior to pan and zoom the image on canvas.  Also, filedrop.js made the...

Read more »

Rcpp11 3.1.1

July 22, 2014
By

R 3.1.1 was released a few days ago, and as part of the policy we are trying to follow for Rcpp11 releases, here is Rcpp11 3.1.1. Sorry for the 12 days delay, but I was away in California, and Rcpp11 travelled with me, so I could not properly test the ...

Read more »

Announcing Packrat v0.4

July 22, 2014
By
Announcing Packrat v0.4

We’re excited to announce a new release of Packrat, a tool for making R projects more isolated and reproducible by managing their package dependencies. This release brings a number of exciting features to Packrat that significantly improve the user experience: Automatic snapshots ensure that new packages installed in your project library are automatically tracked by

Read more »

Intermission: A Data File For Futures Data (from Quandl)

July 22, 2014
By
Intermission: A Data File For Futures Data (from Quandl)

So between variations of different strategies, for those who have yet to come across it, my IKTrading package has a … Continue reading →

Read more »

Comrades Marathon: A Race for Geriatrics?

July 22, 2014
By
Comrades Marathon: A Race for Geriatrics?

It has been suggested that the average Comrades Marathon runner is gradually getting older. As an “average runner” myself, I will not deny that I am personally getting older. But, what I really mean is that the average age of all runners taking part in this great event is gradually increasing. This is not just

Read more »

How to Choose an R-Trainer?

July 22, 2014
By
How to Choose an R-Trainer?

by James Peruvankal There are plenty of options if you want to learn R and are looking for training: your college’s statistics department, massive open online courses like Coursera, Udacity, edX, Datacamp etc. SiliconANGLE recently published an article about top R-training companies. Let’s talk about how to choose a good R-trainer. First and foremost is technical competency in R...

Read more »

Rcpp11 t-shirt sightseeing

July 22, 2014
By
Rcpp11 t-shirt sightseeing

While we wait for CRAN to process the release of Rcpp11 3.1.1, here are some news about the Rcpp11 t-shirt. With more than 40 attendees, the Rcpp11 tutorial was a success. I managed to distribute almost all t-shirts I printed for the occasions. I onl...

Read more »

Things I Forget: Push/Pull Greyed out in RStudio?!?

July 22, 2014
By
Things I Forget: Push/Pull Greyed out in RStudio?!?

So, on more than one occasion I have set up a repository locally, then on GitHub and pushed to that repo from the shell. This works great, but this would always result in the Push and Pull buttons in RStudio getting greyed out. I could push just fine from the shell, but not from the

Read more »

Introducing tidyr

July 22, 2014
By
Introducing tidyr

tidyr is new package that makes it easy to “tidy” your data. Tidy data is data that’s easy to work with: it’s easy to munge (with dplyr), visualise (with ggplot2 or ggvis) and model (with R’s hundreds of modelling packages). The two most important properties of tidy data are: Each column is a variable. Each

Read more »

Note to self: brew cleanup r

July 22, 2014
By

Note to self: after updating R with Homebrew remember to cleanup old versions: brew cleanup r Otherwise I'm liable to get a segfault. (see also)

Read more »

EARL Conference, London, 15-17 September 2014

July 22, 2014
By
EARL R Bloggers ad (1)

EARL (Effective Applications of the R Language) is a unique conference dedicated to the growing commercial usage and business applications of R. We currently have less than 100 delegate places left and less than 10 places for each of the pre-conference workshops. For full details of the conference and to purchase your tickets online please visit www.earl-conference.com  or contact [email protected]  (Academic and group discounts...

Read more »

A Handy Trick for Remote Graphics

July 22, 2014
By
A Handy Trick for Remote Graphics

I often create plots that require quite a bit of computation.  Ideally I would run this on what I’ll call Machine A, which is a very fast machine, but I am often far away, on Machine B.  So, I’d like to run my computation on B but display it on A. For the platforms I

Read more »

Notes from the 2nd R in Insurance Conference

July 22, 2014
By
Notes from the 2nd R in Insurance Conference

The 2nd R in Insurance conference took place last Monday, 14 July, at Cass Business School London. This one-day conference focused once more on applications in insurance and actuarial science that use R. Topics covered included reserving, pricing, loss...

Read more »

There’s no mistake in the barley data

July 21, 2014
By
There’s no mistake in the barley data

Statistics has many canonical data sets. For classification statistics, we have the Fisher's iris data. For Big Data statistics, the canonical data set used in many examples is the Airlines data. And for dotplots, we have the barley data, first popularized by Bill Cleveland in the landmark 1993 text Visualizing Data. Cleveland's innovations in data visualiation were hugely influential...

Read more »

UseR 2014, days 3-4

July 21, 2014
By
UseR 2014, days 3-4

Three weeks ago, I’d commented on the first two days of the UseR 2014 conference. I’m finally back to talk about the second half. Dirk Eddelbuettel on Rcpp Dirk Eddelbuettel gave a keynote on Rcpp . The goal of Rcpp is to have “the speed of C++ with the ease and clarity of R.” He

Read more »

Geomorph update 2.1.1 now available!

July 21, 2014
By
Geomorph update 2.1.1 now available!

Geomorph users,We have uploaded version 2.1.1 to CRAN. The windows and mac binaries have been compiled and the tarball is available.Version 2.1.1 contains small updates and fixes a few small bugs.New Feature: Specimens can now be rotated...

Read more »

Master interactive documents at the Shiny Dev Center

July 21, 2014
By
Master interactive documents at the Shiny Dev Center

We’ve added a new section of articles to the Shiny Development Center. These articles explain how to create interactive documents with Shiny and R Markdown. You’ll learn how to Use R Markdown to create reproducible, dynamic reports. R Markdown offers one of the most efficient workflows for writing up your R results. Create interactive documents

Read more »

Income distribution and Tour de France

July 21, 2014
By

A few days ago, Jean-François Mignot published an interesting article entitled Tour de France 2014 : pourquoi le vainqueur gagne 100 fois plus que le 10e. In this article, we have the following graph, with the income of the cyclist, as a function of his final ranking (the data where downloaded from http://sportbuzzbusiness.fr/) > bike=read.csv( + "http://freakonometrics.free.fr/tourdefrance.csv", + sep=";",header=TRUE,dec=" ") >...

Read more »

How to update and backup a MySQL database under version control and all within Rstudio

July 21, 2014
By
How to update and backup a MySQL database under version control and all within Rstudio

I am trying to have better workflows to ensure data quality and two important things for me are first, scripting as much as posible the data manipulation process, and second, backing up the database we use under version control (e.g. … Continue reading →

Read more »

Changes in rnoaa v0.2.0

July 21, 2014
By

We just released v0.2 of rnoaa. For details on the update, see the release notes. What follows are some notes on the more important changes. Updating to v0.2 Install rnoaa from CRAN install.packages("rnoaa") or Github devtools::install_github("ropensci/rnoaa") Then load rnoaa library("rnoaa") UI changes We changed almost all function names to have a more intuitive programmatic user interface (or UI). We changed all noaa*() functions to ncdc*() - these work...

Read more »

Weekly Variability Simulation of Fantasy Football Projections

July 20, 2014
By
Weekly Variability Simulation of Fantasy Football Projections

In this post, I show how to estimate players’ week-to-week variability in fantasy football points.  In a prior post, I demonstrated how to calculate a player’s risk level, as defined The post Weekly Variability Simulation of Fantasy Football Projections appeared first on Fantasy Football Analytics.

Read more »

Graphing football world cup shots with rCharts

July 20, 2014
By

So, the FIFA World Cup is over. Germany won. Yay!Now with so much less football matches to watch, I tried the awesome new R package rCharts by Ramnath Vaidyanathan. In this case, I used the capability to generate NVD3 charts. Check it out...You ca...

Read more »