Blog Archives

GoodReads: Machine Learning (Part 3)

September 30, 2016
By
GoodReads: Machine Learning (Part 3)

In the first installment of this series, we scraped reviews from Goodreads. In the second one, we performed exploratory data analysis and created new variables. We are now ready for the “main dish”: machine learning! Setup and general data prep Let’s start by loading the libraries and our dataset. library(data.table) library(dplyr) library(caret) library(RTextTools) library(xgboost) library(ROCR) Related PostMachine Learning for...

Read more »

GoodReads: Exploratory data analysis and sentiment analysis (Part 2)

September 14, 2016
By
GoodReads: Exploratory data analysis and sentiment analysis (Part 2)

After scraping reviews from Goodreads in the first installment of this series, we are now ready to do some exploratory data analysis to get a better sense of the data we have. This will also allow us to create features that we will use in future analyses. Setup and data preparation We start by loading Related PostGoodReads: Webscraping and...

Read more »

GoodReads: Webscraping and Text Analysis with R (Part 1)

September 8, 2016
By
GoodReads: Webscraping and Text Analysis with R (Part 1)

Inspired by this article about sentiment analysis and this guide to webscraping, I have decided to get my hands dirty by scraping and analyzing a sample of reviews on the website Goodreads. The goal of this project is to demonstrate a complete example, going from data collection to machine learning analysis, and to illustrate a Related PostEuro 2016 analytics:...

Read more »

Search R-bloggers

Sponsors

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)