Blog Archives

Vega.jl Rebooted – Now with 100% More Pie and Donut Charts!

May 21, 2015
By
Vega.jl Rebooted – Now with 100% More Pie and Donut Charts!

          Mmmmm, chartjunk! Rebooting Vega.jl Recently, I’ve found myself without a project to hack on, and I’ve always been interested in learning more about browser-based visualization. So I decided to revive the work that John Myles White had done in building Vega.jl nearly two years ago. And since I’ll be giving

Read more »

Sessionizing Log Data Using data.table [Follow-up #2]

January 20, 2015
By

Thanks to user dnlbrky, we now have a third way to accomplish sessionizing log data for any arbitrary time out period (see methods 1 and 2), this time using data.table from R along with magrittr for piping: I agree with dnlbrky in that this feels a little better than the dplyr method for heavy SQL users

Read more »

Sessionizing Log Data Using dplyr [Follow-up]

January 13, 2015
By
Sessionizing Log Data Using dplyr [Follow-up]

Last week, I wrote a blog post showing how to sessionize log data using standard SQL. The main idea of that post is that if your analytics platform supports window functions (like Postgres and Hive do), you can make quick work out of sessionizing logs. Here’s the winning query:One nested sub-query and two window functions are

Read more »

RSiteCatalyst Version 1.4.3 Release Notes

January 6, 2015
By
RSiteCatalyst Version 1.4.3 Release Notes

It’s a new year, so…new version of RSiteCatalyst on CRAN! For the most part, this release fixes a handful of bugs that weren’t noticed with the prior release 1.4.2 (oops!), but there are pieces of additional functionality. New functionality: Data Feed monitoring For those of you having hourly or daily data feeds delivered via FTP,

Read more »

Review: Data Science at the Command Line

December 15, 2014
By
Review: Data Science at the Command Line

Admission: I didn’t really know how computers worked until around 2012. For the majority of my career, I’ve worked for large companies with centralized IT functions. Like many statisticians, I fell into a comfortable position of learning SAS in a Windows environment, had Ops people to fix any Unix problems I’d run into and DBAs to

Read more »

RSiteCatalyst Version 1.4.2 Release Notes

December 3, 2014
By
RSiteCatalyst Version 1.4.2 Release Notes

RSiteCatalyst version 1.4.2 is now available on CRAN. This update was primarily bug fixes with one additional feature added. Fixed QueueRanked function to allow multiple SAINT classifications to be specified. This allows for breaking down a SAINT classification with another SAINT classification, such as breaking down tracking codes by marketing channel and by campaign Fixed

Read more »

RSiteCatalyst Version 1.4.1 Release Notes

November 10, 2014
By
RSiteCatalyst Version 1.4.1 Release Notes

Changes Version 1.4.1 of RSiteCatalyst is now available on CRAN. There were a handful of bug fixes and new features added, including: Fixed bug in QueueRanked function where only 10 results were returned when requesting multiple element reports. Function now returns up to 50,000 per breakdown (API limit) Created better error message to inform user

Read more »

Evaluating BreakoutDetection

November 6, 2014
By
Evaluating BreakoutDetection

A couple of weeks ago, Twitter open-sourced their BreakoutDetection package for R, a package designed to determine shifts in time-series data. The Twitter announcement does a great job of explaining the main technique for detection (E-Divisive with Medians), so I won’t rehash that material here. Rather, I wanted to see how this package works relative

Read more »

Visualizing Website Pathing With Sankey Charts

September 10, 2014
By
Visualizing Website Pathing With Sankey Charts

In my prior post on visualizing website structure using network graphs, I referenced that network graphs showed the pairwise relationships between two pages (in a bi-directional manner). However, if you want to analyze how your visitors are pathing through your site, you can visualize your data using a Sankey chart. Visualizing Single Page-to-Next Page Pathing Related posts:

Read more »

Visualizing Website Pathing With Network Graphs

September 8, 2014
By

Last week, version 1.4 of RSiteCatalyst was released, and now it’s possible to get site pathing information directly within R. Now, it’s easy to create impressive looking network graphs from your Adobe Analytics data using RSiteCatalyst and d3Network. In this blog post, I will cover simple and force-directed network graphs, which show the pairwise representation between pages. Related posts:

Read more »