## Data Referenced Journalism and the Media – Still a Long Way to Go Yet?

November 4, 2011
Reading our local weekly press this evening (the Isle of Wight County Press), I noticed a page 5 headline declaring “Alarm over death rates at St Mary’s”, St Mary’s being the local general hospital. It seems a Department of Health report on hospital mortality rates came out earlier this week, and the Island’s hospital, it

## Confidence interval for predictions with GLMs

November 4, 2011
Consider a (simple) Poisson regression . Given a sample where , the goal is to derive a 95% confidence interval for given , where is the prediction. Hence, we want to derive a confidence interval for the prediction, not the potential observation, i.e. the dot on the graph below > r=glm(dist~speed,data=cars,family=poisson) > P=predict(r,type="response", + newdata=data.frame(speed=seq(-1,35,by=.2))) > plot(cars,xlim=c(0,31),ylim=c(0,170)) > abline(v=30,lty=2)...

## Help: stemming and stem completion with package tm in R

November 3, 2011
I came across a problem below when doing stemming and stem completion with package tm in R. Word “mining” was stemmed to “mine” with stemDocument(), and then completed to “miners”with stemCompletion(). However, I prefer to keep “mining” intact. For stemCompletion(), … Continue reading →

## Maximizing Omega Ratio

November 3, 2011
$Maximizing Omega Ratio$

The Omega Ratio was introduced by Keating and Shadwick in 2002. It measures the ratio of average portfolio wins over average portfolio losses for a given target return L. Let x.i, i= 1,…,n be weights of instruments in the portfolio. We suppose that j= 1,…,T scenarios of returns with equal probabilities are available. I will

## Code Optimization: One R Problem, Ten Solutions – Now Eleven!

November 2, 2011
Earlier this year I came across a rather interesting page about optimisation in R from rwiki. The goal was to find the most efficient code to produce strings which follow the pattern below given a single integer input n: From this we can see that the general pattern for n is: It is rather heart

## "Applications of R" contest submissions online

November 2, 2011
Thanks to everyone for participating in the "Applications of R in Business" contest. R users submitted more than 25 entries, describing how R is used in industries including life sciences, finance, manufacturing, sentiment analysis, and even sports. Some entries are just outlines for now (competitors have until November 30 to finalize their entries), but already there are some quite...

## Cycles in finite populations: A reproducible seminar in three acts

November 1, 2011
For this years Halloween I presented the mathematical biology seminar at the Centre for Mathematical Biology. Here is the title and the abstract… Cycles in finite populations: a reproducible seminar in three acts Many natural populations exhibit cyclic fluctuations. Explaining the underlying … Continue reading →

## Selecting statistics for ABC model choice [R code]

November 1, 2011
As supplementary material to the ABC paper we just arXived, here is the R code I used to produce the Bayes factor comparisons between summary statistics in the normal versus Laplace example. (Warning: running the R code takes a while!) Filed under: R, Statistics, University life Tagged: ABC, Bayesian model choice, Laplace distribution, R, summary

## Use case: combining taxize and rgbif

November 1, 2011
Sure thing….this is just the sort of thing for which rOpenSci is being built. A colleague of mine recently saw our packages in development and thought, “Hey, that could totally make my life easier.”   What was made easier you ask?   This was his situation: He had a list of ca. 1200 species of

