## (Unsurprisingly) users default to the defaults

September 19, 2012
Oddities tend to jump out when one uses software in a daily basis. The situation is even clearer when using software for teaching: many more people looking at it with fresh eyes. Let's say that we are fitting a simple

## Vertnet – getting vertebrate museum record data and a quick map

September 19, 2012
We (rOpenSci) started a repo to wrap the API for VertNet, an open access online database of vertebrate specimen records across many collection holders. Find the open source code here - please contribute if you are so inclined. We had a great Google Su...

## Automatic drug utilization reports with R and ggplot2

September 18, 2012
This program takes a data set of drug utilisation of 4 fictional drugs in 10 fictional hospitals and plots each time-series with a locally weighted regression (Lowess) trend line. It also places an time-series trend of the usage for each

## DESeq vs edgeR Comparison

September 18, 2012
Update (Dec 18, 2012): Please see this related post I wrote about differential isoform expression analysis with Cuffdiff 2.DESeq and edgeR are two methods and R packages for analyzing quantitative readouts (in the form of counts) from high-throughput e...

## Creating Posters using R, Latex, Beamer and Beamerposter

September 18, 2012
A while ago I had the need to produce some posters that included lots of data (scientific style).  Having recently got back into R and learning LaTex I googled for a way to do this using R.  Here's what I found and ended up with, using R, LaT...

## Embedding $$\LaTeX$$ in Tumblr

September 18, 2012
The classic Pythagorean identity is:  $$sin^2(\theta) + cos^2(\theta) =1$$ The binomial formula which calculates the probability of obtaining k tails when flipping a coin n times, with a assumed probability p for each trial is: \( P(E)   = {n \choos...

## Getting data from figures in published papers

September 18, 2012
The problem:There are a lot of figures in published papers in the scholarly literature, like the below, from (Attwood et. al. 2012)):At some point, a scientist wants to ask a question for which they can synthesize the knowledge on that question b...

## Permanent Portfolio

September 17, 2012
First, just a quick update: I’m moving the release date of the SIT package a few months down the road, probably in November. Now back to the post. Recently I came across a series of interesting posts about the Permanent Portfolio at the GestaltU blog. Today I want to show you how to back-test the

## What is Tony talking about?

September 17, 2012
I first experimented with word clouds several years ago and used them to visualise the speeches of Kevin Rudd and Malcolm Turnbull. I have now learned from the Fell Stats blog (via R-Bloggers) that there is an R package for generating word clouds.  The package makes use of tm, a text mining package for R, which I have been

## Olimpic predictions – from an R web service provider’s point of view

September 17, 2012
Hello, world!Back in July we have read Markus Gesmann’s great blogpost about a prediction for the 100m final in London. Soon we decided to create similar estimates about the forthcoming events and started to post our results on Facebook.We would like to emphasise again that these kind of extrapolated estimates are rather just for fun and we also think...