## Exploratory data analysis techniques

March 3, 2014
In my previous blog post I have explained the steps needed to solve a data analysis problem. Going further, I will be discussing in-detail each and every step of Data Analysis. In this post, we shall discuss about exploratory Analysis.What is Exploratory Analysis?“Understanding data visually”Exploratory Analysis means analyzing the datasets to summarize their main characteristics,...

## Plotting an Odd number of plots in single image

March 3, 2014
Sometimes I have the need to reduce the number of images for a presentation or an article. A good way of doing it is putting multiple plot on the same tif or jpg file.R has multiple functions to achieve this objective and a nice tutorial for this topic...

## rCharts with slidy

March 3, 2014
My last post I talked about using rCharts to create interactive graphics for my interview presentations. They seemed to go over pretty well in my interviews and helped me greatly as I did not need to remember or write down specific numbers to talk about. I use slidy to create my HTML slideshows and there was some...

## Oldies but Goldies: Statistical Graphics Books

March 1, 2014
I just wanted to plug for three classical books on statistical graphics that I really enjoyed reading. The books are old (that is, older than me) but still relevant and together they give a sense of the development of exploratory graphics in general and the graphics system in R specifically as all three books were written at Bell Labs...

## Bayesian First Aid: Two Sample t-test

February 24, 2014
As spring follows winter once more here down in southern Sweden, the two sample t-test follows the one sample t-test. This is a continuation of the Bayesian First Aid alternative to the one sample t-test where I’ll introduce the two sample alternative. It will be a quite short post as the two sample alternative is just more of...

## Quick and dirty notes on General Linear Mix Models

February 24, 2014
My datasets tend to have random factors. I try to stick to general models whenever I can to avoid dealing with both random factors and complex error distributions (not always possible). I am compiling some notes here to avoid visiting … Continue reading →

## ggplot2: Cheatsheet for Visualizing Distributions

February 18, 2014
In the third and last of the ggplot series, this post will go over interesting ways to visualize the distribution of your data.

## Conditioning and Grouping with Lattice Graphics

February 17, 2014
Conditioning and grouping are two important concepts in graphing that allow us to rapidly refine our understanding of data under consideration. Conditioning, in particular, allows us to view relationships across “panels” with common scales. Each panel contains a plot whose data is “conditional” upon records drawn from the category that supports that particular panel (an

## Bayesian First Aid: One Sample and Paired Samples t-test

February 4, 2014
Student’s t-test is a staple of statistical analysis. A quick search on Google Scholar for “t-test” results in 170,000 hits in 2013 alone. In comparison, “Bayesian” gives 130,000 hits while “box plot” results in only 12,500 hits. To be honest, if I had to choose I would most of the time prefer a notched boxplot to...