Blog Archives

Creating progress bars with foreach parallel processing

March 10, 2015
By
Creating progress bars with foreach parallel processing

by Andrie de Vries In my previous post, I demonstrated how to get some status of running jobs on a parallel back end. However, I stopped short of actually demonstrating progress bars. In this post I demonstrate how to do this. The StackOverflow question How do you create a progress bar when using the “foreach()” function in R? ()...

Read more »

Monitoring progress of a foreach parallel job

February 24, 2015
By
Monitoring progress of a foreach parallel job

by Andrie de Vries R has strong support for parallel programming, both in base R and additional CRAN packages. For example, we have previously written about foreach and parallel programming in the articles Tutorial: Parallel programming with foreach and Intro to Parallel Random Number Generation with RevoScaleR. The foreach package provides simple looping constructs in R, similar to lapply()...

Read more »

An update to the checkpoint package

February 18, 2015
By

by Andrie de Vries During October 2014 we announced RRT (the Reproducible R Toolkit) that consists of the checkpoint package and the MRAN. In January, David Smith followed up with another post about reproducibility using Revolution R Open. Since then, we've had several requests for new features and enhancements. The development code for checkpoint is available at GitHub. The...

Read more »

How the MKL speeds up Revolution R Open

October 22, 2014
By
How the MKL speeds up Revolution R Open

by Andrie de Vries Last week we announced the availability of Revolution R Open, an enhanced distribution of R. One of the enhancements is the inclusion of high performance linear algebra libraries, specifically the Intel MKL. This library significantly speeds up many statistical calculations, e.g. the matrix algebra that forms the basis of many statistical algorithms. Several years ago,...

Read more »

Introducing the Reproducible R Toolkit and the checkpoint package

October 13, 2014
By
Introducing the Reproducible R Toolkit and the checkpoint package

The ability to create reproducible research is an important topic for many users of R. So important, that several groups in the R community have tackled this problem. Notably, packrat from RStudio, and gRAN from Genentech (see our previous blog post). The Reproducible R Toolkit is a new open-source initiative from Revolution Analytics. It takes a simple approach to...

Read more »

Introducing miniCRAN: an R package to create a private CRAN repository

October 3, 2014
By

by Andrie deVries One of the reasons that R is so popular is the CRAN archive of useful packages. However, with more than 5,900 packages on CRAN, many organisations need to maintain a private mirror of CRAN with only a subset of packages that are relevant to them. The package miniCRAN makes this possible by determining the dependency tree...

Read more »

Revisiting package dependencies

July 29, 2014
By
Revisiting package dependencies

by Andrie de Vries In my previous post I wrote about how to identify and visualize package dependencies. Within hours, Duncan Murdoch (member of R-core) identified some discrepancies between my list of dependencies and the visualisation. Since then, I fixed the dispecrancies. In this blog post I attempt to clarify the issues involved in listing package dependencies. In miniCRAN...

Read more »

Dependencies of popular R packages

July 8, 2014
By
Dependencies of popular R packages

With the growing popularity of R, there is an associated increase in the popularity of online forums to ask questions. One of the most popular sites is StackOverflow, where more than 60 thousand questions have been asked and tagged to be related to R. On the same page, you can also find related tags. Among the top 15 tags...

Read more »

Reading data from the new version of Google Spreadsheets

June 3, 2014
By
Reading data from the new version of Google Spreadsheets

Spreadsheets remain an important way for people to share and work with data. Among other providers, Google has provided the ability to create online spreadsheets and other documents. Back in 2009, David Smith posted a blog entry on how to use R, and specifically the XML package to import data from a Google Spreadsheet. Once you marked your Google...

Read more »

Using survival models for marketing attribution

July 23, 2013
By

by Andrie de Vries Prior to joining Revolution Analytics in March this year, I spent several years in the field of market research and survey analytics. During this period, I spent a few months consulting to a digital marketing agency based in London. My role was to help build their capability in building customer surveys and integrating these into...

Read more »