## ISBA towards higher computing goals [yet another new section!!!]

September 19, 2012
Surrounding the great and exciting gathering of Bayesian statisticians in Kyoto last June, several ISBA sections have appeared in the past weeks, as already mentioned on the ‘Og. Along with Anto Mira and Nicolas Chopin (who did most of the organisational work while I was wandering down under!), we discussed about a Bayesian computation section

## 2012-5 Voronoi Treemaps in R

September 19, 2012
A Voronoi Treemap is a visualisation tool for displaying data with a hierarchical structure.  This article describes an open source implementation of Voronoi Treemaps in the R language and environment for statistical computing and graphics.  The article includes example usage … Continue reading →

## FinancialInstrument Moves to CRAN

September 19, 2012
I thought I would break up the posts about GSOC (no, I’m not done yet – there are a few more to do) with a quick note about FinancialInstrument. The FinancialInstrument package provides a construct for defining and storing meta-data for tradable contracts (referred to as instruments, e.g., stocks, futures, options, etc.). The package can

## Why are pirates called pirates?

September 19, 2012
In homage to International Talk Like a Pirate Day… I recently stumbled across a series of blog posts from the folks at IDV that visualised the archive of recorded pirate attacks which has been collected by the US National Geospatial-Intelligence Agency. It’s a dataset of 6000+ pirate attacks which have been recorded over the last 30 … Continue reading...

## The PMML Revolution: Predictive analytics at the speed of business

September 19, 2012
This guest post is by Alex Guazzelli, VP of Analytics at Zementis Inc. -- ed. PMML, the Predictive Model Markup Language, is the de facto standard to represent predictive analytics and data mining models. With PMML, it is extremely easy to move a predictive solution from one system to another, since it avoids proprietary issues and incompatibilities. Companies around...

## Stock Market and US elections

September 19, 2012
Quantitative Finance, Technical Trading & Analysis. Fotis Papailias, Dimitrios Thomakos Fotis Quantitative Finance & Technical Trading Stock Market and US elections We made a very simple R file that historically gathers the period before and after the US elections. The inexperienced user has the ability to set the tickers of asset she wants to study and the look-back and look-forward...

## SPSS and R day in York

September 19, 2012
On 2 November there are some introductory health-focussed training sessions on SPSS and R software going on in York: Two sets of two half-day parallel workshops and two tutorial sessions are provisionally planned to be held at the Alcuin Research … Continue reading →

## (Unsurprisingly) users default to the defaults

September 19, 2012
Oddities tend to jump out when one uses software in a daily basis. The situation is even clearer when using software for teaching: many more people looking at it with fresh eyes. Let’s say that we are fitting a simple … Continue reading →

## Vertnet – getting vertebrate museum record data and a quick map

September 19, 2012
We (rOpenSci) started a repo to wrap the API for VertNet, an open access online database of vertebrate specimen records across many collection holders. Find the open source code here - please contribute if you are so inclined. We had a great Google Su...

## Divide or Mix. Flexible Approaches to Data Analysis

September 19, 2012
A very interesting paradigm in data analysis comes from the necessity to model data where it is difficult to think of a single global function to be capable to represent adequately the data. We could see a spectrum of models … Continue reading →

## Technical Analysis of Montana Voter Registration Files

September 18, 2012
By Earl F Glynn | Franklin Center This article shows how to analyze Montana voter registration files using R. The Montana Secretary of State sells two files of voter registration records for a fee of \$1000: Statewide Voter File Voter History File The Montana Secretary of State provides information about the two files on this

## Automatic drug utilization reports with R and ggplot2

September 18, 2012
This program takes a data set of drug utilisation of 4 fictional drugs in 10 fictional hospitals and plots each time-series with a locally weighted regression (Lowess) trend line. It also places an time-series trend of the usage for each … Continue reading →

September 18, 2012
Conrad released the bug-fix release 3.4.1 of Armadillo earlier today, and the corresponding RcppArmadillo package 0.3.4.1 is already on CRAN. No R level or interface changes were, the upstream changes are summarized below. Changes in RcppArmadillo...

## DESeq vs edgeR Comparison

September 18, 2012
Update (Dec 18, 2012): Please see this related post I wrote about differential isoform expression analysis with Cuffdiff 2.DESeq and edgeR are two methods and R packages for analyzing quantitative readouts (in the form of counts) from high-throughput e...

## Creating Posters using R, Latex, Beamer and Beamerposter

September 18, 2012
A while ago I had the need to produce some posters that included lots of data (scientific style).  Having recently got back into R and learning LaTex I googled for a way to do this using R.  Here's what I found and ended up with, using R, LaT...

## Copulas and tail dependence, part 3

September 18, 2012
We have seen extreme value copulas in the section where we did consider general families of copulas. In the bivariate case, an extreme value can be writtenwhere is Pickands dependence function, which is a convex function satisfyingObserve that in ...

## Copulas and tail dependence, part 2

September 18, 2012
An alternative to describe tail dependence can be found in the Ledford & Tawn (1996) for instance. The intuition behind can be found in Fischer & Klein (2007)). Assume that and have the same distribution. Now, if we assume that those vari...

## Designing Data Apps with R at Periscopic

September 18, 2012
Today's guest post comes to us from Andrew Winterman, Data Designer at data visualization company Persiscopic. He shares with us the process of using the R language and other tools to create an interactive data application for a client — ed. The Hewlett Foundation contacted us a few months ago because they were interested in exploring ways to visualize...

## R Graphics with ggplot2

September 18, 2012
ggplot2 is one of the most elegant R package for data analysis and visualization.  Recently I gave a tutorial on ggplot2 package.  You could find my ggplot2 notes here(click the image below). You could find my presentation slide below. The … Continue reading →The post R Graphics with ggplot2 appeared first on Fiddling with data and...

## Side note…

September 18, 2012
MathJax allows you to customize how $$\LaTeX$$ is displayed. Simply right click over the math you’d like to see to access the display menu. Under “math settings” you can see zoom trigger and factor options. Given how small the text ...

## Embedding $$\LaTeX$$ in Tumblr

September 18, 2012
The classic Pythagorean identity is:  $$sin^2(\theta) + cos^2(\theta) =1$$ The binomial formula which calculates the probability of obtaining k tails when flipping a coin n times, with a assumed probability p for each trial is: \( P(E)   = {n \choos...

## Getting data from figures in published papers

September 18, 2012
The problem: There are a lot of figures in published papers in the scholarly literature, like the below, from (Attwood et. al. 2012)): At some point, a scientist wants to ask a question for which they can synthesize the knowledge on that question b...

## Using R in Insurance at GIRO 2012

September 17, 2012
Every year the UK’s general insurance actuarial community organises a big conference, which they call GIRO, short for General Insurance Research Organising committee. This year's conference is in Brussels from 18 - 21 September 2012. Despite the fac...

## Copulas and tail dependence, part 1

September 17, 2012
As mentioned in the course last week Venter (2003) suggested nice functions to illustrate tail dependence (see also some slides used in Berlin a few years ago). Joe (1990)'s lambda Joe (1990) suggested a (strong) tail dependence index. For lower t...

## Why are some things easier to forecast than others?

September 17, 2012
Forecasters are often met with skepticism. Almost every time I tell someone that I work in forecasting, they say something about forecasting the stock market, or forecasting the weather, usually suggesting that such forecasts are hopelessly inaccurate. In fact, forecasts of the weather are amazingly accurate given the complexity of the system, while anyone claiming to forecast the stock...

## Permanent Portfolio

September 17, 2012
First, just a quick update: I’m moving the release date of the SIT package a few months down the road, probably in November. Now back to the post. Recently I came across a series of interesting posts about the Permanent Portfolio at the GestaltU blog. Today I want to show you how to back-test the

## In search of large ice floes

September 17, 2012
In search of large ice floes.

## INLA functions (yet again)

September 17, 2012
This links back to previous posts here and here. Earlier today, I had a quick chat with Michela (by email, actually) on this topic. In particular, she was trying to use the function I've written to compute summaries from the posterior distrib...

## Start your new relationship with data together with Roger Peng and 30000 other students

September 17, 2012
A week from today (on September 24) Coursera, an education technology company committed to making education freely available to any person who seeks it, is launching their online course “Computing