2425 search results for "ggplot"

Starting Analysis and Visualisation of Spatial Data with R

April 8, 2013
By
Starting Analysis and Visualisation of Spatial Data with R

Last week I ran an introductory workshop on the analysi

Read more »

Dirichlet Process, Infinite Mixture Models, and Clustering

April 7, 2013
By
Dirichlet Process, Infinite Mixture Models, and Clustering

The Dirichlet process provides a very interesting approach to understand group assignments and models for clustering effects.   Often time we encounter the k-means approach.  However, it is necessary to have a fixed number of clusters.  Often we encounter situations where we don’t know how many fixed clusters we need.  Suppose we’re trying to identify

Read more »

Sync

April 7, 2013
By
Sync

I am listening to the audiobook Sync: How Order Emerges from Chaos in the Universe, Nature, and Daily Lifeby Steven Strogatz which I got from Audible. Obviously a mathematical book is not ideal to listen to, but lacking illustrations I can ma...

Read more »

Mortality after paediatric heart surgery using public domain data

April 6, 2013
By
Mortality after paediatric heart surgery using public domain data

This post comes with some big health warnings. The recent events in Leeds highlight the difficulties faced in judging the results of surgery by individual hospital. A clear requirement is timely access to data in a form easily digestible by the public. Here I’ve scraped the publically available data from the central cardiac audit database

Read more »

Worry about correctness and repeatability, not p-values

April 5, 2013
By
Worry about correctness and repeatability, not p-values

In data science work you often run into cryptic sentences like the following: Age adjusted death rates per 10,000 person years across incremental thirds of muscular strength were 38.9, 25.9, and 26.6 for all causes; 12.1, 7.6, and 6.6 for cardiovascular disease; and 6.1, 4.9, and 4.2 for cancer (all P < 0.01 for linear Related posts:

Read more »

Multiple pairwise comparisons for categorical predictors

April 5, 2013
By
Multiple pairwise comparisons for categorical predictors

Dale Barr (@datacmdr) recently had a nice blog post about coding categorical predictors, which reminded me to share my thoughts about multiple pairwise comparisons for categorical predictors in growth curve analysis. As Dale pointed out in his post, the R default is to treat the reference level of a factor as a...

Read more »

Estimated Follower Accession Charts for Twitter

April 5, 2013
By
Estimated Follower Accession Charts for Twitter

Just over a year or so ago, Mat Morrison/@mediaczar introduced me to a visualisation he’d been working on (How should Page Admins deal with Flame Wars?) that I started to refer to as an accession chart (Visualising Activity Around a Twitter Hashtag or Search Term Using R). The idea is that we provide each entrant

Read more »

d3 Lifeline from vega and clickme

April 4, 2013
By

This has been an exciting week for d3.js and R with the release of vega by the data vis powerhouses at Trifacta launch of clickme and already significant rewrite to accommodate vega inception of a very promising d3 templates DexCharts described in multiple posts. I am glad to have had...

Read more »

Announcing eeptools 0.2

April 4, 2013
By

My R package eeptools has reached version 0.2. As with the last release, this is still a preliminary release which means that functionality is not full, function names and code behavior may still change from version to version, and I am still looking f...

Read more »

R Beginners – Plotting Locations on to a World Map

April 2, 2013
By
R Beginners – Plotting Locations on to a World Map

This post is targeted at those who are just getting started plotting on maps using R.The relevant libraries are: maps, ggplot2, ggmap, and maptools. Make sure you install them.The Problem Let's take a fairly simple use case: We have a few points on the...

Read more »