Part 3b: EDA with ggplot2

March 16, 2015 | Pedro M.

In Part 3a I have introduced the plotting system ggplot2. I talked about its concept and syntax with some detail, and then created a few general plots, using the weather data set we've been working with in this series of tutorials. My goal was to show ... [Read more...]

Part 3a: Plotting with ggplot2

March 4, 2015 | Pedro M.

We will start off this first section of Part 3 with a brief introduction of the plotting system ggplot2. Then, with the attention focused mainly on the syntax, we will create a few graphs, based on the weather data we have prepared previously. Next, in Part 3b, where we will be ... [Read more...]

Part 2: Data Preparation

February 20, 2015 | Pedro M.

In Part 1 I have introduced the weather data set we will be using in this series of tutorials. We are now going to have the data prepared for the subsequent EDA. We will recode and transform variables, change their types, and perform some basic data ch... [Read more...]

Part 1: Introduction

February 14, 2015 | Pedro M.

The ultimate goal of every data scientist is to extract as much valuable information as possible from a given data set. We want to be able to predict the future based on the past, to discover very deep and hidden patterns in the data, and to expand the current base ... [Read more...]

