Blog Archives

An update to the checkpoint package

February 18, 2015
By

by Andrie de Vries During October 2014 we announced RRT (the Reproducible R Toolkit) that consists of the checkpoint package and the MRAN. In January, David Smith followed up with another post about reproducibility using Revolution R Open. Since then, we've had several requests for new features and enhancements. The development code for checkpoint is available at GitHub. The...

Read more »

How the MKL speeds up Revolution R Open

October 22, 2014
By
How the MKL speeds up Revolution R Open

by Andrie de Vries Last week we announced the availability of Revolution R Open, an enhanced distribution of R. One of the enhancements is the inclusion of high performance linear algebra libraries, specifically the Intel MKL. This library significantly speeds up many statistical calculations, e.g. the matrix algebra that forms the basis of many statistical algorithms. Several years ago,...

Read more »

Introducing the Reproducible R Toolkit and the checkpoint package

October 13, 2014
By
Introducing the Reproducible R Toolkit and the checkpoint package

The ability to create reproducible research is an important topic for many users of R. So important, that several groups in the R community have tackled this problem. Notably, packrat from RStudio, and gRAN from Genentech (see our previous blog post). The Reproducible R Toolkit is a new open-source initiative from Revolution Analytics. It takes a simple approach to...

Read more »

Introducing miniCRAN: an R package to create a private CRAN repository

October 3, 2014
By

by Andrie deVries One of the reasons that R is so popular is the CRAN archive of useful packages. However, with more than 5,900 packages on CRAN, many organisations need to maintain a private mirror of CRAN with only a subset of packages that are relevant to them. The package miniCRAN makes this possible by determining the dependency tree...

Read more »

Revisiting package dependencies

July 29, 2014
By
Revisiting package dependencies

by Andrie de Vries In my previous post I wrote about how to identify and visualize package dependencies. Within hours, Duncan Murdoch (member of R-core) identified some discrepancies between my list of dependencies and the visualisation. Since then, I fixed the dispecrancies. In this blog post I attempt to clarify the issues involved in listing package dependencies. In miniCRAN...

Read more »

Dependencies of popular R packages

July 8, 2014
By
Dependencies of popular R packages

With the growing popularity of R, there is an associated increase in the popularity of online forums to ask questions. One of the most popular sites is StackOverflow, where more than 60 thousand questions have been asked and tagged to be related to R. On the same page, you can also find related tags. Among the top 15 tags...

Read more »

Reading data from the new version of Google Spreadsheets

June 3, 2014
By
Reading data from the new version of Google Spreadsheets

Spreadsheets remain an important way for people to share and work with data. Among other providers, Google has provided the ability to create online spreadsheets and other documents. Back in 2009, David Smith posted a blog entry on how to use R, and specifically the XML package to import data from a Google Spreadsheet. Once you marked your Google...

Read more »

Using survival models for marketing attribution

July 23, 2013
By

by Andrie de Vries Prior to joining Revolution Analytics in March this year, I spent several years in the field of market research and survey analytics. During this period, I spent a few months consulting to a digital marketing agency based in London. My role was to help build their capability in building customer surveys and integrating these into...

Read more »