2062 search results for "ggplot2"

xkcd Style Bubble Plot

May 23, 2013
By
xkcd Style Bubble Plot

A package was recently released to generate plots in the style of xkcd using R. Being a big fan of the cartoon, I could not resist trying it out. So I set out to produce something like one of Hans Rosling’s bubble plots. First I needed some data. Spoilt for choice. I scraped some population data broken

Read more »

My Prime Sieve – Homage to Yitan Zhang

May 22, 2013
By
My Prime Sieve – Homage to Yitan Zhang

# As a homage to Yitang Zhang who has proven a mind-bending property of Prime Pairs, I have written a prime Sieve to detect all of the prime numbers from 1 to N. # There might very well be a function in the base package that already does this. No...

Read more »

Analyzing a simple experiment with heterogeneous variances using asreml, MCMCglmm and SAS

May 17, 2013
By
Analyzing a simple experiment with heterogeneous variances using asreml, MCMCglmm and SAS

I was working with a small experiment which includes families from two Eucalyptus species and thought it would be nice to code a first analysis using alternative approaches. The experiment is a randomized complete block design, with species as fixed effect and family and block as a random effects, while the response variable is growth

Read more »

1.5 percent of doctors, a quarter of malpratice reports

May 14, 2013
By
1.5 percent of doctors, a quarter of malpratice reports

Some doctors receive more malpractice reports than others. Just how unequal is the distribution of malpractice reports? The post 1.5 percent of doctors, a quarter of malpratice reports appeared first on Decision Science News.

Read more »

SIR Model – The Flue Season – Dynamic Programming

May 14, 2013
By
SIR Model – The Flue Season – Dynamic Programming

# The SIR Model (susceptible, infected, and recovered) model is a common and useful tool in epidemiological modelling.# In this post and in future posts I hope to explore how this basic model can be enriched by including different population group...

Read more »

Visualizing your websites’ ecommerce performance with R

May 14, 2013
By
Visualizing your websites’ ecommerce performance with R

In this blogpost, I want to dive deeper into the explanation of the relationship between Frequency and Recency of Visits with the Conversion Rate and Average Order Value. I have used the RGA package for data extraction and Dr. Hadley Wickham’s ggplot2 package to achieve the visualizations. Here’s the data aggregation script : #transactions dataframe

Read more »

Integration take two – Shiny application

May 13, 2013
By
Integration take two – Shiny application

My last post discussed a technique for integrating functions in R using a Monte Carlo or randomization approach. The mc.int function (available here) estimated the area underneath a curve by multiplying the proportion of random points below the curve by the total area covered by points within the interval: The estimated integration (bottom plot) is

Read more »

Global Indicator Analyses with R

May 13, 2013
By
Global Indicator Analyses with R

I was recently asked by a client to create a large number of “proof of concept” visualizations that illustrated the power of R for compiling and analyzing disparate datasets. The client was specifically interested in automated analyses of global data. A little research led me to the WDI package. The WDI package is a tool The post Global...

Read more »

The Guerilla Guide to R

May 12, 2013
By

Update: Okay. I've uploaded a new template and things seem to be fine now. Update: I am aware the table of contents is not being displayed in bullet form as I intended. The web template I'm using seems to be buggy. It also seems to think this page is in Indonesian...Working on it! Table of Contents: Reading/Writing Files

Read more »

Veterinary Epidemiologic Research: Count and Rate Data – Poisson Regression and Risk Ratios

May 10, 2013
By
Veterinary Epidemiologic Research: Count and Rate Data – Poisson Regression and Risk Ratios

As noted on paragraph 18.4.1 of the book Veterinary Epidemiologic Research, logistic regression is widely used for binary data, with the estimates reported as odds ratios (OR). If it’s appropriate for case-control studies, risk ratios (RR) are preferred for cohort studies as RR provides estimates of probabilities directly. Moreover, it is often forgotten the assumption

Read more »