2065 search results for "ggplot"

Carl Morris Symposium on Large-Scale Data Inference (2/3)

October 20, 2012
By
Carl Morris Symposium on Large-Scale Data Inference (2/3)

Continuing the summary of last week’s symposium on statistics and data visualization (see part 1 and part 3)… Here I describe Dianne Cook’s discussion of visual inference, and Rob Kass’ talk on statistics in cognitive neuroscience. [Edit: I've added a few … Continue reading →

Read more »

The rapidly increasing ideology of the US Republican Party

October 18, 2012
By
The rapidly increasing ideology of the US Republican Party

The chart below comes by way of the is.R blog and shows the average ideology of the members of the United State House of Representatives within the Republican (red) and Democratic (blue) parties. (Other parties are shown in green.) The chart is shown as a time series, from the first US congress in 1789, to the most recent full...

Read more »

Benchmarking distance calculation in R

October 18, 2012
By
Benchmarking distance calculation in R

A typical step in a lot of data mining methods is the calculation of a distance between entities. For example using the nearest-neighbor method it is crucial to do this calculation very efficiently because it is the most time-consuming step of the procedure. Just imagine you want to compute the Euclidean distance between a constantly changing database...

Read more »

Faceting as a preferable alternative to 3-D

October 18, 2012
By
Faceting as a preferable alternative to 3-D

Sometimes, people want to plot things in three dimensions. Others have spoken more eloquently than I could on the potential problems with plotting multiple two-dimensional relationships in a two-dimensional medium with an artificial three-dimensional ...

Read more »

Basic ideas on aggregate, plyr and crosstables!

October 17, 2012
By
Basic ideas on aggregate, plyr and crosstables!

A common task using R is the investigation of one particular dataset. Usually we have a mixture of numerical and categorial data and are interested in some statistics (e.g. means and so on). And there are a lot of threads, blogs etc around that. Sorry for adding another one, but so I remember myself. Let’s

Read more »

9 reasons to use RStudio

October 16, 2012
By
9 reasons to use RStudio

In no particular order, here are nine reasons why I really like the RStudio IDE for the R statistical programming language. 1) R benefits from an IDE – I accept that in some languages an IDE is unnecessary—Perl is the first example … Continue reading →

Read more »

Using consistent R and LaTeX fonts in Org (or knitr, or Sweave)

October 15, 2012
By
Using consistent R and LaTeX fonts in Org (or knitr, or Sweave)

I love good typography, even more so as Microsoft Word and PowerPoint have debased our standards. When I see a really fine piece of technical typesetting, it’s almost always done using TeX and friends. Beautiful LaTeX documents are easy to … Continue reading →

Read more »

How do I re-arrange…?: Ordering a plot.

October 15, 2012
By
How do I re-arrange…?: Ordering a plot.

One of the most widely seen FAQ coming across list serves and R help sites is the question: “How do I re-arrange/re-order (plotting geom/aesthetic such as bar/labels) in a (insert plot type here) using(insert graphics system here) in R?” . … Continue reading →

Read more »

Vice Presidential Debates with qdap-beta

October 13, 2012
By
Vice Presidential Debates with qdap-beta

After the presidential debates I used the beta version of qdap to provide some initial surface level analysis (LINK to Presidential Debates with qdap-beta). In the comments of that post, annon (a commenter) provided a link to an analysis/visualization that … Continue reading →

Read more »

Minute by Minute Twitter Sentiment Timeline from the VP debate

October 12, 2012
By
Minute by Minute Twitter Sentiment Timeline from the VP debate

Click on above graph to enlarge. Background The data for this graph was collected automatically every ~60 seconds of the VP debate on 10/11/2012, with an ending aggregate sample size of 363,163 tweets.  From this dataset duplicate tweets were removed (because of bots), which gave a final dataset of 81,124 remaining unique tweets (52,303-Biden, 28,821-Ryan).

Read more »