# Monthly Archives: July 2012

## Success does not require understanding

July 23, 2012
I took part in the second Data Science London Hackathon last weekend (also my second hackathon) and it was a very different experience compared to the first hackathon. Once again Carlos and his team really looked after us. The data was released 24 hours before the competition started and even though I had spent less

## How to write a rapport template

July 23, 2012
This post will show an introduction for the users how to produce a template, so how to produce similar results, like those one can see on rapport's homepage or in our forthcoming reporting web application.The post was written from the view of a Windows user, if problems were came up because you use...

## Estimating required hospital bed capacity

July 23, 2012
Estimating required hospital bed capacity requires a thorough analysis. There are a lot of ways of approaching a capacity requirement problem, but I think we can agree that a simple spreadsheet analysis just won't cut it. The approach described in this post makes use of discrete-event simulation and, just to  Read...

## Music Data Hackathon 2012 – Beginner’s view

July 23, 2012
When I first heard of the existence of Hackathons (receive a data set, predict the response as good as possible, win money. All within 24 hours), I had two thoughts:1. Wow, that sounds greats. Like a huge game for intelligent people.2. My skills are no...

## Modeling Trick: Impact Coding of Categorical Variables with Many Levels

July 23, 2012
One of the shortcomings of regression (both linear and logistic) is that it doesn’t handle categorical variables with a very large number of possible values (for example, postal codes). You can get around this, of course, by going to another modeling technique, such as Naive Bayes; however, you lose some of the advantages of regression Related posts:

## Computing the degree of dependency (jointness) among explanatory variables using BMS

July 23, 2012
﻿Capturing the dependence between explanatory variables in the posterior distribution while implementing a Bayesian analysis is crucial. Taking such a dependence into account reveals the sensitivity of posterior distributions of parameters to depen...

## Third year wrap-up

July 23, 2012
July marks the end of three years of blogging for us. By our count, we've posted 121 examples across the first three years. We aim to be helpful and interesting.As always, it's hard to get a sense of our readership. At the time we wrote this, Feedbur...

## XLConnect 0.2-0

July 23, 2012
Mirai Solutions GmbH (http://www.mirai-solutions.com) is very pleased to announce the release of XLConnect 0.2-0, which can be found at CRAN. As one of the updates, XLConnect has moved to the newest release of Apache POI: 3.8. Also, the lazy evaluation … Continue reading →

## R Optimization Test

I have tested several R optimization functions before: nlm, optim(Nelder-Mead), optim(BFGS), optim(SANN), nlminb, optim (L-BFGS-B) for a eight-parameter Vasicek interest rate model, overall I find that for my setting, nlminb is the best and all R func...

