1796 search results for "ggplot2"

Distribution of colors by flag

October 22, 2012
By
Distribution of colors by flag

A story: We showed you how to use R to assess flag similarity and make a scatter plot of raster images. Dr. Wickham referred us to the set of 2400 flag icons made available by GoSquared, and then (probably jokingly) challenged us to replicate the cool...

Read more »

Basics of JavaScript and D3 for R Users

October 21, 2012
By
Basics of JavaScript and D3 for R Users

Hadley Wickham, creator of the ggplot2 R package, has been learning JavaScript and its D3 library for the next iteration of ggplot2 (tentatively titled r2d3?)… so I suspect it’s only a matter of time before he pulls the rest of the … Continue reading →

Read more »

Player timelines with ggplot

October 21, 2012
By
Player timelines with ggplot

Timelines can be quite a handy way of getting an overview of a player’s career in terms of when they played, with which team and who were their contemporaries As often is the case, I turned to Stackoverflow to set me on my way for an R solution. In this instance, I did not take

Read more »

Carl Morris Symposium on Large-Scale Data Inference (2/3)

October 20, 2012
By
Carl Morris Symposium on Large-Scale Data Inference (2/3)

Continuing the summary of last week’s symposium on statistics and data visualization (see part 1 and part 3)… Here I describe Dianne Cook’s discussion of visual inference, and Rob Kass’ talk on statistics in cognitive neuroscience. [Edit: I've added a few … Continue reading →

Read more »

Adding a background to your ggplot

October 19, 2012
By
Adding a background to your ggplot

I really enjoy using the DW-NOMINATE data for examples, as I do here. Sometimes it’s useful to indicate regions in the background of a plot — perhaps two-dimensional regions of interest, perhaps one-dimensional periods in time. It’s...

Read more »

Benchmarking distance calculation in R

October 18, 2012
By
Benchmarking distance calculation in R

A typical step in a lot of data mining methods is the calculation of a distance between entities. For example using the nearest-neighbor method it is crucial to do this calculation very efficiently because it is the most time-consuming step of the procedure. Just imagine you want to compute the Euclidean distance between a constantly changing database...

Read more »

Faceting as a preferable alternative to 3-D

October 18, 2012
By
Faceting as a preferable alternative to 3-D

Sometimes, people want to plot things in three dimensions. Others have spoken more eloquently than I could on the potential problems with plotting multiple two-dimensional relationships in a two-dimensional medium with an artificial three-dimensional ...

Read more »

Basic ideas on aggregate, plyr and crosstables!

October 17, 2012
By
Basic ideas on aggregate, plyr and crosstables!

A common task using R is the investigation of one particular dataset. Usually we have a mixture of numerical and categorial data and are interested in some statistics (e.g. means and so on). And there are a lot of threads, blogs etc around that. Sorry for adding another one, but so I remember myself. Let’s

Read more »

9 reasons to use RStudio

October 16, 2012
By
9 reasons to use RStudio

In no particular order, here are nine reasons why I really like the RStudio IDE for the R statistical programming language. 1) R benefits from an IDE – I accept that in some languages an IDE is unnecessary—Perl is the first example … Continue reading →

Read more »

Using consistent R and LaTeX fonts in Org (or knitr, or Sweave)

October 15, 2012
By
Using consistent R and LaTeX fonts in Org (or knitr, or Sweave)

I love good typography, even more so as Microsoft Word and PowerPoint have debased our standards. When I see a really fine piece of technical typesetting, it’s almost always done using TeX and friends. Beautiful LaTeX documents are easy to … Continue reading →

Read more »