Blog Archives

Interactive association rules exploration app

November 29, 2015
By
Interactive association rules exploration app

Features How to get How to use Screenshots Code In a previous post, I wrote about what I use association rules for and mentioned a Shiny application I developed to explore and visualize rules. This post is about that app. The app is mainly a wrapper around the

Read more »

Advanced Tips and Tricks with data.table

August 30, 2015
By
Advanced Tips and Tricks with data.table

Tips and tricks learned along the way 1. DATA STRUCTURES & ASSIGNMENT => Columns of lists => Suppressing intermediate output with {} => Fast looping with set => Using shift for to lead/lag vectors and lists => Create multiple columns with := in one statement ...

Read more »

Render reports directly from R scripts

March 4, 2015
By

Workflow This post is really about workflow. Specifically a data-science workflow, although it should be relevant for others. It will probably resonate most (if at all) with those who have some experience (mostly positive) generating reports from Rmarkdown files with knitr, but might have some gripes. Maybe not gripes, maybe just feelings of uncertainty over whether it...

Read more »

Blogging with Jekyll and R Markdown using knitr

January 24, 2015
By

use knitr One way to blog using R and Jekyll is to copy and paste every code chunk, output and plot into a plain vanilla markdown file by hand. This is cumbersome, especially for plots which need to be saved as images and embedded. I tried this for my first few posts. It was clunky. A more seamless...

Read more »

Latent Dirichlet Allocation – under the hood

January 16, 2015
By

LDA for mortals I’ve been intrigued by LDA topic models for a few weeks now. I actually built one before I really knew what I was doing. That kind of frightens and excites me. On one hand, LDA provides rich output which is easy for the humanist researcher to interpret. On the other hand, shouldn’t I...

Read more »

New York Times Article Search API to MongoDB

January 5, 2015
By

Motivation Accessing NYT API Extracting and parsing the article body text Writing to MongoDB Pipeline Results Motivation I’ve learned a little about a lot of different corners of the text mining and NLP world over the last few years… which sometimes makes me feel like I know nothing for certain....

Read more »

Scraping with Selenium

December 10, 2014
By

If you’ve ever… felt like you’re playing Simon Says with mouse clicks when repeatedly extracting data in chunks from a front-end interface to a database on the web, well, you probably are. There’s probably a better solution – Selenium. ever used XML or httr in R or urllib2 in Python, you’ve probably encountered the situation where the source code you’ve scraped for...

Read more »

DIY building an R package

November 19, 2014
By

Why create a personal R package? As a consulting data scientist, I write a lot of R code in a lot of different places – physically and virtually. Different computers, servers, evironments, VPNs, operating systems, all of the above. Even when I have the luxury of working with the same client (and computing environment) for enough time to work on...

Read more »

Deploying Shiny apps with shinyapps.io

October 10, 2014
By

So I’ve been messing around with Shiny for a year or so now. It’s great tool and getting greater. the good: capability to rapidly build an interactive visualization with the full universe of R packages to choose from for the computation engine and visualization options. ability to code and house the visualization/app in the same language/place as the code...

Read more »

How accurate is Next Bus III: getting the answers

September 16, 2014
By
How accurate is Next Bus III: getting the answers

So now that we’ve collected data from the web and wrangled it into something useful, what can we say about how accurate Next Bus is? This post is about the “making of the analysis” … which might be rather boring to those non data geeks (normal people). If you’re just interested in the story and the pictures,...

Read more »

Sponsors

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)