Articles by Hasan Imtiaz

Data Hacking with RDSTK 3

February 16, 2017 | Hasan Imtiaz

RDSTK is a very versatile package. It includes functions to help you convert IP address to geo locations and derive statistics from them. It also allows you to input a body of text and convert it into sentiments. This is a continuation from the last exercise RDSTK 2 We are going ...
[Read more...]

Data Hacking with RDSTK 2

February 11, 2017 | Hasan Imtiaz

RDSTK is a very versatile package. It includes functions to help you convert IP address to geo locations and derive statistics from them. It also allows you to input a body of text and convert it into sentiments. This is a continuation from the last exercise RDSTK 1 This package provides ...
[Read more...]

Data Hacking with RDSTK (part 1)

January 31, 2017 | Hasan Imtiaz

RDSTK is a very versatile package. It includes functions to help you convert IP address to geo locations and derive statistics from them. It also allows you to input a body of text and convert it into sentiments. This package provides an R interface to Pete Warden’s Data Science ...
[Read more...]

Let’s get started with dplyr

January 12, 2017 | Hasan Imtiaz

The dplyr package by Hadley Wickham is a very useful package that provides “A Grammar of Data Manipulation”. It aims to simplify common data manipulation tasks, and provides “verbs”, i.e. functions that correspond to the most common data manipulation tasks. Have fun playing with dplyr in the exercises below! ... [Read more...]

Intermediate Tree 2

January 5, 2017 | Hasan Imtiaz

This is a continuation of the intermediate decision tree exercise. Answers to the exercises are available here. If you obtained a different (correct) answer than those listed on the solutions page, please feel free to post your answer as a comment on that page. Exercise 1 use the predict() command to ... [Read more...]

Intermediate Tree 1

December 29, 2016 | Hasan Imtiaz

If you followed through the Basic Decision Tree exercise, this should be useful for you. This is like a continuation but we add so much more. We are working with a bigger and badder datasets. We will be also using techniques we learned from model evaluation and work with ROC, ... [Read more...]

Model Evaluation 2

December 22, 2016 | Hasan Imtiaz

We are committed to bringing you 100% authentic exercise sets. We even try to include as different datasets as possible to give you an understanding of different problems. No more classifying Titanic dataset. R has tons of datasets in its library. This is to encourage you to try as many datasets ... [Read more...]

Basic Tree 2 Exercises

December 15, 2016 | Hasan Imtiaz

This is a continuation of the exercise Basic Tree 1 Answers to the exercises are available here. If you obtained a different (correct) answer than those listed on the solutions page, please feel free to post your answer as a comment on that page. Exercise 1 load the tree library. If it ... [Read more...]

Basic Tree 1 Exercises

December 9, 2016 | Hasan Imtiaz

Using the knowledge you acquired in the previous exercises on sampling and selecting(here), we will now go through an entire data analysis process. You will be using what you know as crutches to solve the problems. Don’t worry. It might look intimidating but follow the sequence and you ... [Read more...]

Select and Query Exercise

December 6, 2016 | Hasan Imtiaz

In this exercise we cover the basics on selecting and extracting data using queries. We add a few other materials into it. This should prepare you for the next exercise: Basic Decision Tree. The purpose of this is to give you the 20 percent of the tools to get 80 percent of ... [Read more...]

Model Evaluation Exercises 1

December 2, 2016 | Hasan Imtiaz

We are committed to bringing you 100% authentic exercise sets. We even try to include as different datasets as possible to give you an understanding of different problems. No more classifying Titanic dataset. R has tons of datasets in its library. This is to encourage you to try as many datasets ... [Read more...]

Frequency and chi-square test for independence Exercises

November 30, 2016 | Hasan Imtiaz

In this exercise, we cover some basics on frequency tables. We also briefly look at chi-square test for independence to find relationships of two variables. Before proceeding, it might be helpful to look over the help pages for the table, summary, and margin.table functions. Answers to the exercises are ... [Read more...]

Sampling Exercise Part 1

November 13, 2016 | Hasan Imtiaz

In this Exercise, we will dive quickly through some basic sampling methods. Follow along this series to use these methods later for our decision trees modelling exercise. We will sample using the package caTools and caret. This is a beginner level exercise. Please refer to the help section for set.... [Read more...]

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)