Exploratory Data Analysis

Plotting Time Series data using ggplot2

September 30, 2010 | Ralph

There are various ways to plot data that is represented by a time series in R. The ggplot2 package has scales that can handle dates reasonably easily. Fast Tube by Casper As an example consider a data set on the number of views of the you tube channel ramstatvid. A ...
[Read more...]

Charting the performance of cricket all-rounders – IT Botham

August 16, 2010 | Ralph

Cricket is a sport that generates a large volume of performance data and corresponding debate about the relative qualities of various players over their careers and in relation to their contemporaries. The cricinfo website has an extensive database of statistics for professional cricketers that can be searched to access the ... [Read more...]

Displaying data using level plots

May 3, 2010 | Ralph

A level plot is a type of graph that is used to display a surface in two rather than three dimensions – the surface is viewed from above as if we were looking straight down and is an alternative to a contour plot – geographic data is an example of where this ...
[Read more...]

Summarising data using box and whisker plots

April 25, 2010 | Ralph

A box and whisker plot is a type of graphical display that can be used to summarise a set of data based on the five number summary of this data. The summary statistics used to create a box and whisker plot are the median of the data, the lower and ...
[Read more...]

R and Tolerance Intervals

April 19, 2010 | Ralph

Confidence intervals and prediction intervals are used by statisticians on a regular basis. Another useful interval is the tolerance interval that describes the range of values for a distribution with confidence limits calculated to a particular percentile of the distribution. The R package tolerance can be used to create a ... [Read more...]

Summarising data using scatter plots

April 18, 2010 | Ralph

A scatter plot is a graph used to investigate the relationship between two variables in a data set. The x and y axes are used for the values of the two variables and a symbol on the graph represents the combination for each pair of values in the data set. ...
[Read more...]

Summarising data using histograms

April 11, 2010 | Ralph

The histogram is a standard type of graphic used to summarise univariate data where the range of values in the data set is divided into regions and a bar (usually vertical) is plotted in each of these regions with height proportional to the frequency of observations in that region. In ... [Read more...]

Summarising data using dot plots

March 26, 2010 | Ralph

A dot plot is a type of display that compares counts, frequencies, totals or other summary measures for a series of categories. The dot plot can be arranged with the categories either on the vertical or horizontal axis of the display to allow comparising between the different categories as well ... [Read more...]

Eight R Video Tutorials on VCASMO

February 4, 2010 | Ed Borasky

Download "Getting Started with the Social Media Analytics Research Toolkit" (pdf, 1.25 megabytes) Download the Social Media Analytics Research Toolkit Thanks to Drew Conway (@drewconway), a PhD student at New York University, there are now eight excell... [Read more...]

Three Must-Have Books on Data Visualization

February 3, 2010 | Ed Borasky

Download "Getting Started with the Social Media Analytics Research Toolkit" (pdf, 1.25 megabytes) Download the Social Media Analytics Research Toolkit Disclosure As you probably know, I live in the Portland, Oregon area and have for many years. One of ... [Read more...]

Summarising data using bar charts

December 12, 2009 | Ralph

A bar graph is a frequently used type of display that compares counts, frequencies, totals or other summary measures for a series of categories, e.g. sales in different market sectors or in quarters in a financial year. The bar graph can be laid out with the categories either on ... [Read more...]

Creating scatter plots using ggplot2

November 6, 2009 | Ralph

The ggplot2 package can be used as an alternative to lattice for producing high quality graphics in R. The package provides a framework and hopefully simple interface to producing graphs and is inspired by the grammar of graphics. The main function for producing graphs in this package is qplot, which ... [Read more...]

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)