335 search results for "boxplot"

Labeled outliers in R boxplot

Labeled outliers in R boxplot

Boxplots are a good way to get some insight in your data, and while R provides a fine ‘boxplot’ function, it doesn’t label the outliers in the graph. However, with a little code you can add labels yourself:The numbers plotted next to ...

Read more »

My Commonly Done ggplot2 graphs: Part 2

December 18, 2014
By
My Commonly Done ggplot2 graphs: Part 2

In my last post I described some of my commonly done ggplot2 graphs. It seems as though some people are interested in these, so I was going to follow this up with other plots I make frequently. Scatterplot colored by continuous variable The setup of the data for the scatterplots will be the same as

Read more »

Study of a plot

December 3, 2014
By
Study of a plot

I began to think on a nice way of plotting campaign expenditures in a paper I'm working on. I thought this would be something like the following--simple but meaningful even when there are outliers in both tails. Though I like the seniors Tukey's boxplot and scatter plots, I had already used them the last time … Read More...

Read more »

Visualization of probabilistic forecasts

November 21, 2014
By
Visualization of probabilistic forecasts

This week my research group discussed Adrian Raftery’s recent paper on “Use and Communication of Probabilistic Forecasts” which provides a fascinating but brief survey of some of his work on modelling and communicating uncertain futures. Coincidentally, today I was also sent a copy of David Spiegelhalter’s paper on “Visualizing Uncertainty About the Future”. Both are

Read more »

Bioinformatics journals: time from submission to acceptance, revisited

October 13, 2014
By
Bioinformatics journals: time from submission to acceptance, revisited

Before we start: yes, we’ve been here before. There was the Biostars question “Calculating Time From Submission To Publication / Degree Of Burden In Submitting A Paper.” That gave rise to Pierre’s excellent blog post and code + data on Figshare. So why are we here again? 1. It’s been a couple of years. 2.

Read more »

ggvis 0.4

October 13, 2014
By
ggvis 0.4

ggvis 0.4 is now available on CRAN. You can install it with: install.packages("ggvis") The major features of this release are: Boxplots, with layer_boxplots() chickwts %>% ggvis(~feed, ~weight) %>% layer_boxplots() Better stability when errors occur. Better handling of empty data and malformed data. More consistent handling of data in compute pipeline functions. Because of these changes,

Read more »

Multiple Tests, an Introduction

September 24, 2014
By
Multiple Tests, an Introduction

Last week, a student asked me about multiple tests. More precisely, she ran an experience over – say – 20 weeks, with the same cohort of – say – 100 patients. An we observe some size=100 nb=20 set.seed(1) X=matrix(rnorm(size*nb),size,nb) (here, I just generate some fake data). I can visualize some trajectories, over the 20 weeks, library(RColorBrewer) cl1=brewer.pal(12,"Set3") cl2=brewer.pal(8,"Set2") cl=c(cl1,cl2)...

Read more »

“Do You Want to Steal a Snowman?” – A Look (with R) At TorrentFreak’s Top 10 PiRated Movies List #TLAPD

September 18, 2014
By
“Do You Want to Steal a Snowman?” – A Look (with R) At TorrentFreak’s Top 10 PiRated Movies List #TLAPD

We leave the Jolly Roger behind this year and turn our piRate spyglass towards the digital seas and take a look at piRated movies as seen through the lens of TorrentFreak. The seasoned seadogs who pilot that ship have been doing a weekly “Top 10 Pirated Movies of the Week” post since early 2013, and...

Read more »

Comparing machine learning models in R

September 18, 2014
By
Comparing machine learning models in R

by Joseph Rickert While preparing for the DataWeek R Bootcamp that I conducted this week I came across the following gem. This code, based directly on a Max Kuhn presentation of a couple years back, compares the efficacy of two machine learning models on a training data set. #----------------------------------------- # SET UP THE PARAMETER SPACE SEARCH GRID ctrl <-...

Read more »

R version of “An exploratory technique for visualizing the distributions of 100 variables:”

September 10, 2014
By
R version of “An exploratory technique for visualizing the distributions of 100 variables:”

Rick Wicklin (@RickWicklin) made a recent post to the SAS blog on An exploratory technique for visualizing the distributions of 100 variables. It’s a very succinct tutorial on both the power of boxplots and how to make them in SAS (of course). I’m not one to let R be “out-boxed”, so I threw together a

Read more »