2nd CFP: the 10th Australasian Data Mining Conference (AusDM 2012)

July 10, 2012
The Tenth Australasian Data Mining Conference (AusDM 2012) Sydney, Australia, 5-7 December 2012 http://ausdm12.togaware.com/ The Australasian Data Mining Conference has established itself as the premier Australasian meeting for both practitioners and researchers in data mining. This year’s conference, AusDM’12, co-hosted … Continue reading →

July 10, 2012
Sourcing Code from GitHub

July 10, 2012
In previous posts I described how to input data stored on GitHub directly into R. You can do the same thing with source code stored on GitHub. Hadley Wickham has actually made the whole process easier by combining the getURL, textConnection, and source commands into one function: source_url. This is in his devtools...

Fitting a dynamic model, and determining the number of parameters that can be fitted.

July 8, 2012
Let's suppose that we have the same dynamic model we presented before - that is, the Lorentz system of differential equations. Remember? In order to perform a fitting we need to define an objective function of sort: this will then be minimised. Now,...

The Actuary Puzzle 508 – Square numbers

The Actuary Puzzle 508 - Square numbers Author: Matt Malin From the puzzle pages of The Actuary June 2012, I attempt to solve the following, making use of R: This square contains exactly 21 smaller squares. Each of these smaller squares has sides of integer length, with no two smaller squares having sides of the same length. Can you find a solution for...

July 6, 2012
Error metrics for multi-class problems in R: beyond Accuracy and Kappa

July 6, 2012
The caret package for R provides a variety of error metrics for regression models and 2-class classification models, but only calculates Accuracy and Kappa for multi-class models.  Therefore, I wrote the following function to allow caret:::train t...

Fix Overplotting with Colored Contour Lines

July 6, 2012
I saw this plot in the supplement of a recent paper comparing microarray results to RNA-seq results. Nothing earth-shattering in the paper - you've probably seen a similar comparison many times before - but I liked how they solved the overplotting...

More Exploration of Crazy RUT

July 5, 2012
Unintentionally while playing with the lawstat package in R, I started trying to build systems (STANDARD DISCLAIMER: NOT INVESTMENT ADVICE AND WILL LOSE LOTS OF MONEY SO PROCEED WITH CAUTION) based on the Jarque Bera test of normality (entry in Wikiped...