Blog Archives

New R titles available in Chinese

August 26, 2015
By
New R titles available in Chinese

by Andrie de Vries A few days ago I received an unexpected parcel in my letterbox. To my delight, it turned out to be a translation into Simplified Chinese of R for Dummies, co-authored by myself and Joris Meys. Let me clarify: Joris and I wrote the book, but were not involved in the translation at all. The Wiley...

Read more »

Differences in the network structure of CRAN and BioConductor

August 12, 2015
By
Differences in the network structure of CRAN and BioConductor

by Andrie de Vries This week at JSM2015, the annual conference of the American Statistical Association, Joseph Rickert and I gave a presentation on the topic of "The network structure of CRAN and BioConductor" (link to abstract). Our work tested the hypothesis if one can detect statistical differences in the network graph formed by the dependencies between packages. In...

Read more »

Contracting and simplifying a network graph

August 5, 2015
By
Contracting and simplifying a network graph

by Andrie de Vries In a previous post, I used page rank and community structure to create a plot of CRAN. This plot used vibrant colours to allow us to see some of the underlying structure of CRAN. However, much of this structure was still obfuscated by the amount of detail. Concretely, a large number of dots (packages) made...

Read more »

The most popular programming languages on StackOverflow

July 29, 2015
By
The most popular programming languages on StackOverflow

by Andrie de Vries Last week, IEEE Spectrum said R rised to #6 in Top Programming languages. They use a weighted methodology of 12 factors to compute their score. Among these factors is the activity on social programming websites, including StackOverflow and Github. I recently used data.stackexchange.com to query the total number of questions on StackOverflow using the R...

Read more »

The network structure of CRAN

July 8, 2015
By
The network structure of CRAN

by Andrie de Vries My experience of UseR!2015 drew to an end shortly after I gave a Kaleidoscope presentation discussing "The Network Structure of CRAN". My talk drew heavily on two previous blog posts, Finding the essential R packages using the pagerank algorithm and Finding clusters of CRAN packages using igraph. However, in this talk I went further, attempting...

Read more »

News from UseR!2015 – the RHadoop tutorial

July 1, 2015
By
News from UseR!2015 – the RHadoop tutorial

by Andrie de Vries Today is the first day of UseR!2015 conference in Aalborg in Northern Denmark. But yesterday was a day packed with 16 tutorials on a range of interesting topics. I submitted a proposal many months ago to run a session on using R in Hadoop and was very happy to selected to run a session in...

Read more »

Creating progress bars with foreach parallel processing

March 10, 2015
By
Creating progress bars with foreach parallel processing

by Andrie de Vries In my previous post, I demonstrated how to get some status of running jobs on a parallel back end. However, I stopped short of actually demonstrating progress bars. In this post I demonstrate how to do this. The StackOverflow question How do you create a progress bar when using the “foreach()” function in R? ()...

Read more »

Monitoring progress of a foreach parallel job

February 24, 2015
By
Monitoring progress of a foreach parallel job

by Andrie de Vries R has strong support for parallel programming, both in base R and additional CRAN packages. For example, we have previously written about foreach and parallel programming in the articles Tutorial: Parallel programming with foreach and Intro to Parallel Random Number Generation with RevoScaleR. The foreach package provides simple looping constructs in R, similar to lapply()...

Read more »

An update to the checkpoint package

February 18, 2015
By

by Andrie de Vries During October 2014 we announced RRT (the Reproducible R Toolkit) that consists of the checkpoint package and the MRAN. In January, David Smith followed up with another post about reproducibility using Revolution R Open. Since then, we've had several requests for new features and enhancements. The development code for checkpoint is available at GitHub. The...

Read more »

How the MKL speeds up Revolution R Open

October 22, 2014
By
How the MKL speeds up Revolution R Open

by Andrie de Vries Last week we announced the availability of Revolution R Open, an enhanced distribution of R. One of the enhancements is the inclusion of high performance linear algebra libraries, specifically the Intel MKL. This library significantly speeds up many statistical calculations, e.g. the matrix algebra that forms the basis of many statistical algorithms. Several years ago,...

Read more »