We’ve already briefly covered making a reproducible example, but I thought I’d go a little more in-depth on the very helpful function dput(). Whenever I am having trouble applying an operation to some R object, and need help, but want to av...

We’ve already briefly covered making a reproducible example, but I thought I’d go a little more in-depth on the very helpful function dput(). Whenever I am having trouble applying an operation to some R object, and need help, but want to av...

This guest post is by Alex Guazzelli, VP of Analytics at Zementis Inc. -- ed. PMML, the Predictive Model Markup Language, is the de facto standard to represent predictive analytics and data mining models. With PMML, it is extremely easy to move a predictive solution from one system to another, since it avoids proprietary issues and incompatibilities. Companies around...

The classic Pythagorean identity is: \(sin^2(\theta) + cos^2(\theta) =1 \) The binomial formula which calculates the probability of obtaining k tails when flipping a coin n times, with a assumed probability p for each trial is: \( P(E) = {n \choos...

I have been exploring how to speed up some of my R scripts and have started reading about some amazing corners of R. My first weapon was the Rcpp and RcppArmadillo package. These are wonderful tools and even for someone that has never written c++ before, there are enough to examples and documentation to get started. I...

At last weekend’s Hack Ta Ville event here in Montreal, I joined up with some talented urban planners and web devs to realize Vélobstacles. The idea of the project is to crowd source information on cycling conditions around the city. As with any crowd sourcing project, we were faced with the problem of seeding the