Blog Archives

Data mining with R course in the Netherlands taught by Luis Torgo

January 29, 2014
By

In the course of this year, Dr. Luis Torgo will teach a Data Mining with R course together with the DIKW Academy in Nieuwegein, The Netherlands. Dr. Torgo is an Associate Professor at the department of Computer Science at the… See more ›

Read more »

Vectorisation is your best friend: replacing many elements in a character vector

January 25, 2014
By

As with any programming language, R allows you to tackle the same problem in many different ways or styles. These styles differ both in the amount of code, readability, and speed. In this post I want to illustrate this by… See more ›

Read more »

The performance of dplyr blows plyr out of the water

January 22, 2014
By

Together with many other packages written by Hadley Wickham, plyr is a package that I use a lot for data processing. The syntax is clean, and it works great for breaking down larger data.frame‘s into smaller summaries. The greatest disadvantage… See more ›

Read more »

Bubble sorting in R, C++ and Julia: code improvements and the R compiler

December 28, 2013
By

In the past few months I have written posts about implementing the bubble sort algorithm in different languages. In the mean while I have gotten some feedback and suggestions regarding improvements to the implementation I made, see the end of… See more ›

Read more »

Parallel processing with short jobs only increases the run time

December 27, 2013
By
Parallel processing with short jobs only increases the run time

Parallel processing has become much more important over the years as multi-core processors have become common place. From version 02.14 onwards, parallel processing has become part of the standard R installation in the form of the parallel package. This package… See more ›

Read more »

Julia is lightning fast: bubble sort revisited

December 26, 2013
By

I had heard the name of the new technical computing language Julia buzzing around for some time already. Now during Christmas I had some time on my hands, and implemented the bubble sort algorithm that I have already posted about… See more ›

Read more »

Much more efficient bubble sort in R using the Rcpp and inline packages

May 14, 2013
By

Recently I wrote a blogpost showing the implementation of a simple bubble sort algorithm in pure R code. The downside of that implementation was that is was awfully slow. And by slow, I mean really slow, as in “a 100… See more ›

Read more »

Bubble sort implemented in pure R

May 10, 2013
By
Bubble sort implemented in pure R

Please note that this is programming I purely did for the learning experience. The pure R bubble sort implemented in this post is veeeeery slow for two reasons: Interpreted code with lots of iteration is very slow. Bubble sort is… See more ›

Read more »

Parsing complex text files using regular expressions and vectorization

March 24, 2013
By

When text data is in a nice CSV format, read.csv is enough to parse it into a useable format. But if this is not the case, getting the data into a useable format is not so straightforward. In this post… See more ›

Read more »

Automatic spatial interpolation with R: the automap package

February 17, 2013
By
Automatic spatial interpolation with R: the automap package

In case of continuously collected data, e.g. observations from a monitoring network, spatial interpolation of this data cannot be done manually. Instead, the interpolation should be done automatically. To achieve this goal, I developed the automap package. automap builds on… See more ›

Read more »