227 search results for "boxplot"

Boxplots and Beyond – Part II: Asymmetry

Boxplots and Beyond – Part II: Asymmetry

In my last post, I discussed boxplots in their simplest forms, illustrating some of the useful options available with the boxplot command in the open-source statistical software package R.  As I noted in that post, the basic boxplot is both useful...

Read more »

Clustering NHL Skaters

February 6, 2011
By
Clustering NHL Skaters

I have been sitting on this post for some time now and wanted to get it out there.  The goal is to simply show how easy it is to pull live data from the web into R, massage it, and perform some analytics on it.  I am not sure how useful this analysis really is

Read more »

Boxplots and Beyond – Part I

Boxplots and Beyond – Part I

Boxplots are a simple and reasonably popular way of summarizing the range of variation of a real-valued variable across different subsets of data.  Typical examples might include diastolic blood pressure across a group of patients, broken dow...

Read more »

How to label all the outliers in a boxplot

January 27, 2011
By
How to label all the outliers in a boxplot

In this post I offer an alternative function for boxplot, which will enable you to label outlier observations while handling complex uses of boxplot.

Read more »

sab-R-metrics: Intermediate Scatter Plots

January 25, 2011
By
sab-R-metrics: Intermediate Scatter Plots

First off, I'll say it's been a whirlwind of a past few days. Thanks to David Smith at the Revolutions Blog for his kind words about the sab-R-metrics series and link back this way. Add in Ed Kupfer's posts at the APBRmetrics board, Harry Pavlidis at THT, Dave Allen at Fangraphs...

Read more »

sab-R-metrics: Intermediate Scatter Plots

January 25, 2011
By
sab-R-metrics: Intermediate Scatter Plots

First off, I'll say it's been a whirlwind of a past few days. Thanks to David Smith at the Revolutions Blog for his kind words about the sab-R-metrics series and link back this way. Add in Ed Kupfer's posts at the APBRmetrics board, Harry Pavlidis at THT, Dave Allen at Fangraphs...

Read more »

The Art of Exploratory Data Analysis

The Art of Exploratory Data Analysis

This blog is about the art of exploratory data analysis, which is also the subject of my new book, Exploring Data in Engineering, the Sciences, and Medicine (http://www.oup.com/us/ExploringData).  This art is appropriate in situations where y...

Read more »

Hard drive occupation prediction with R – part 2 – Getting the probability distribution

Hard drive occupation prediction with R – part 2 – Getting the probability distribution

On the first article, we saw a quick-and-dirty method to predict disk space exhaustion when the usage pattern is rigorously linear. We did that by importing our data into R and making a linear regression. In this article we will see the problems wit...

Read more »

Hard drive occupation prediction with R – part 2

Hard drive occupation prediction with R – part 2

On the first article, we saw a quick-and-dirty method to predict disk space exhaustion when the usage pattern is rigorously linear. We did that by importing our data into R and making a linear regression. In this article we will see the problems wit...

Read more »

Learning R through baseball: sab-R-metrics

January 21, 2011
By
Learning R through baseball: sab-R-metrics

The words "statistics" and "baseball" are often found near each other, but there's a lot more to statistics than dividing the number of hits by the number of swings to get a batting average. And there's a lot more to sabermetrics -- the statistical analysis of baseball -- than averages, too. Many baseball fans are also stats geeks (and...

Read more »