2334 search results for "ggplot"

Alternative to Grouped Bar Charts in R

October 27, 2013
By
Alternative to Grouped Bar Charts in R

The #spiffy @dseverski gave me this posit the other day: Hey, @hrbrmstr, doughnut chart aside, how would you approach the first graph at http://t.co/zjHoHRVOeo? Bump chart? Trend line? Leave as is?— David F. Severski (@dseverski) October 25, 2013 and, I...

Read more »

Table as an Image in R

October 24, 2013
By
Table as an Image in R

Usually, it's best to keep tables as text, but if you're making a lot of graphics, it can be helpful to be able to create images of tables.PNG tableCreating the TableAfter loading the data, let's first use this trick to put line breaks between the leve...

Read more »

The Basics of Encoding Categorical Data for Predictive Models

October 23, 2013
By
The Basics of Encoding Categorical Data for Predictive Models

Thomas Yokota asked a very straight-forward question about encodings for categorical predictors: "Is it bad to feed it non-numerical data such as factors?" As usual, I will try to make my answer as complex as possible. (I've heard the old wives tale that eskimos have 180 different words in their language for snow. I'm starting to think that statisticians have...

Read more »

New R package: scholar

October 23, 2013
By
New R package: scholar

My new R package, scholar, has just been posted on CRAN. The scholar package provides functions to extract citation data from Google Scholar. In addition to retrieving basic information about a single scholar, the package also allows you to compare multiple scholars and predict future h-index values. There’s a full guide on Github (along

Read more »

Time series plots in R

October 23, 2013
By
Time series plots in R

I recently coauthored a couple of papers on trends in environmental data (Curtis & Simpson in press; Monteith et al. in press), which we estimated using GAMs. Both papers included plots like the one shown below wherein we show the estimated trend and associated point-wise 95% confidence interval, plus some other markings. The coloured sections show...

Read more »

Pre-calculating large tables of values

Pre-calculating large tables of values

Pre-calculating large tables of values I'm currently working on a project where we want to know, based on a euclidian distance measure, what is the probability that the value is a match to the another value. i.e. given an actual value, and a theoretical value from calculation, what is the probability that they are the same? This can be calculated...

Read more »

OA week – A simple use case for programmatic access to PLOS full text

October 21, 2013
By
OA week – A simple use case for programmatic access to PLOS full text

Open access week is here! We love open access, and think it's extremely important to publish in open access journals. One of the many benefits of open access literature is that we likely can use the text of articles in OA journals for many things, including text-mining. What's even more awesome is some OA publishers provide API (application...

Read more »

When did “How I Met Your Mother” become less legen.. wait for it…

October 21, 2013
By
When did “How I Met Your Mother” become less legen.. wait for it…

…dary!  Or, as you’ll see below, when did it become slightly less legendary?  The analysis in this post was inspired by DiffusePrioR’s analysis of when The Simpsons became less Cromulent. When I read his post a while back, I thought … Continue reading →

Read more »

Bar plot with error bars in R

October 20, 2013
By
Bar plot with error bars in R

Here's a simple way to make a bar plot with error bars three ways: standard deviation, standard error of the mean, and a 95% confidence interval. The key step is to precalculate the statistics for ggplot2. Continue reading →

Read more »

Plotting Times of Discrete Events

October 19, 2013
By
Plotting Times of Discrete Events

I recently enjoyed reading O’Hara, R. B., & Kotze, D. J. (2010). Do not log-transform count data. Methods in Ecology and Evolution, 1(2), 118–122. doi:10.1111/j.2041-210X.2010.00021.x. The article prompted me to think about processes involving discrete events and how these might be presented graphically. I am not talking about counts (which are well represented by a

Read more »