Blog Archives

My Shiny Dashboard, Milwaukee Beer

March 1, 2019
By
My Shiny Dashboard, Milwaukee Beer

Milwaukee Beer - Inspired by my Job Hunt I’m excited to launch my latest Shiny app - “Milwaukee Beer” - which I made to learn shinydashboard. Due to my decision to return to the USA and hunt for a career in data, I decided to add another project to my portfolio. Milwaukee Beer is a metric tracking dashboard that provides...

Read more »

Predicting Churn Using Tree Models

December 31, 2018
By
Predicting Churn Using Tree Models

Today I want to predict churn using data from a hypothetical telecom company. Although it isn’t real life data, it is based on real life data. The data are spread across 19 columns — 14 continuous, 4 categorical, and the outcome variable for prediction - “churn”. The dataset is small, with 3333 rows for training and 1667 for testing. Before...

Read more »

Linear Classification Models – Hepatic Dataset

December 2, 2018
By
Linear Classification Models – Hepatic Dataset

This post is following exercise 1 in Chapter 12 of Applied Predicative Modeling. Here I use the machine learning package CARET in R to make classification models; in particular, the linear classification models discussed in Chapter 12. The dataset in question is about hepatic injury (liver damage). It includes a dataframe of biological related predictors of liver damage bio, a...

Read more »

First App – Halo 5 Stats

September 20, 2018
By
First App – Halo 5 Stats

“Halo 5 Stats” is my first ever Shiny app, which is themed around the the popular sci-fi shooter Halo 5. It combines beatiful data with Halo’s beautiful graphics - something that Halo fans and data enthusiasts undoubtedly love. You can use my app...

Read more »

Part II: Chinese Classics’ Word/Network Plots

June 7, 2018
By
Part II: Chinese Classics’ Word/Network Plots

This is a continuation in my series of exploratory text analysis of 3 Chinese classic works. In the previous post, I calculated word counts for each book, and visualized common words using bar charts. This time, I’d like to examine word use across the texts with network visualization. The goal is to help see what’s common and what’s different...

Read more »

Plotting Word Bigrams with 3 Chinese Classics

May 31, 2018
By
Plotting Word Bigrams with 3 Chinese Classics

In the last post, we saw frequencies of the most common words in the Analects, Zhuangzi, and Mozi texts. The faceted plot did an excellent job of capturing a generic “theme” of each text. However, I wondered how the results might change when plotting bigrams (2 word combinations of adjacent words) as opposed to single values. This is where I...

Read more »

A Tidytext Analysis of 3 Chinese Classics

May 28, 2018
By
A Tidytext Analysis of 3 Chinese Classics

For a long time I’ve admired the tidytext package and its wonderful companion book Text Mining with R. After reading it I thought, “Why not undertake a project of Chinese text analysis?” I am deeply interested in Chinese philosophy but I decided to keep the analysis narrow by selecting just three works - The Analects, Zhuangzi, and the Mozi. Following...

Read more »

Ctextclassics, my First Package

May 16, 2018
By

My latest update is a milestone! I have authored my first ever R package which is an API caller for ctext.org. Ctext hosts numerous pre-modern Chinese texts and my package makes them available to you. The scope is broad, but think philosophical works in Confucianism, Daoism, Legalism, military doctrines, history compilations, works in medicine, and many more. The three main...

Read more »

Scraping for a Booklist of the Chinese Classics

Scraping for a Booklist of the Chinese Classics

Last week I was considering a project that would be interesting and unique. I decided I would like to do a text analysis on classical Chinese texts, but wasn’t sure what kind of analysis regarding which texts. I decided to keep it small - and use five of the “core” Chinese classics - The Analects, The Mengzi, Dao De...

Read more »

On Relocating to Github/Netlify

Deep, labored breathing Hello everyone, this is the opening post on my new blog, which I’m relocating from Wordpress to GitHub Pages and Netlify. It’s so nice I’ve given it a name - because nice things have names! But, why was I panting? The relocation effort wasn’t easy. Why did I follow through with it? Becuase it is worth the effort....

Read more »

Search R-bloggers

Sponsors

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)