Articles by Randy Zwitch

Adobe: Give Credit. You DID NOT Write RSiteCatalyst.

May 9, 2016 | Randy Zwitch

As an author of several open-source software projects, I’ve taken for granted that people using the software share the same community values as I do. Open-source authors provide their code “free” to the community so that others may benefit without having to re-invent the wheel. The only expectation (but ... [Read more...]

Travis CI: “You Have Too Many Tests LOLZ!”

April 5, 2016 | Randy Zwitch

No output has been received in the last 10m0s, this potentially indicates a stalled build or something wrong with the build itself. As part of getting RSiteCatalyst 1.4.8 ready for CRAN, I’ve managed to accumulate hundreds of testthat tests across 63 test files. Each of these tests runs on Travis ... [Read more...]

RSiteCatalyst Version 1.4.8 Release Notes

April 4, 2016 | Randy Zwitch

For being in RSiteCatalyst retirement, I’m ending up working on more functionality lately ¯_(ツ)_/¯. Here are the changes for RSiteCatalyst 1.4.8, which should be available on CRAN shortly: Segment Stacking RSiteCatalyst now has the ability to take multiple values in the keyword for the Queue* functions. This functionality was ... [Read more...]

Calling RSiteCatalyst From Python

February 22, 2016 | Randy Zwitch

@randyzwitch Do you know if anyone has gotten RSiteCat running in a Jupyter Notebook that ran RPY2? Tired of using 2 different environments — Adam Gitzes (@FootballActuary) February 18, 2016 This will be a very short post, because the only “new” information I’m going to provide is the minimal example to answer the ... [Read more...]

RSiteCatalyst Version 1.4.7 (and 1.4.6.) Release Notes

February 1, 2016 | Randy Zwitch

It seems as though I missed release notes for version RSiteCatalyst 1.4.6, so we’ll do those and RSiteCatalyst 1.4.7 (now on CRAN) and the same time… RSiteCatalyst 1.4.6 This release was mostly tweaking some settings, specifically: Adding a second `top` argument within the Queue* functions for more control on results returned. It ... [Read more...]

A Million Text Files And A Single Laptop

January 28, 2016 | Randy Zwitch

More often that I would like, I receive datasets where the data has only been partially cleaned, such as the picture on the right: hundreds, thousands…even millions of tiny files. Usually when this happens, the data all have the same format (such as having being generated by sensors or ... [Read more...]

RSiteCatalyst Version 1.4.5 Release Notes

August 17, 2015 | Randy Zwitch

It’s only been a month since the last RSiteCatalyst update, and this update is also a pretty minor update in terms of functionality. Set Your Own Endpoint For the overseas users (or companies with weird setups), you can now use the endpoint argument in the SCAuth() function to specify ... [Read more...]

Authenticated API Testing Using Travis CI

August 6, 2015 | Randy Zwitch

As I’ve become more serious about contributing in the open-source community, having quality tests for my packages has been something I’ve spent much more time on than when I was just writing quick-and-dirty code for my own purposes. My most used open-sourced package is RSiteCatalyst, which accesses the ... [Read more...]

Getting Started: Adobe Analytics Clickstream Data Feed

August 4, 2015 | Randy Zwitch

“Well, first you need a TMS and a three-tiered data layer, then some jQuery with a node backend to inject customer data into the page asynchronously if you want to avoid cookie-based limitations with cross-domain tracking and be Internet Explorer 4 compatible…” Blah Blah Blah. There’s a whole cottage industry ... [Read more...]

RSiteCatalyst Version 1.4.4 Release Notes

July 13, 2015 | Randy Zwitch

It’s been about six months since the last RSiteCatalyst update, and this update is really just a single bug fix, but a big bug fix at that! Sparse Data = Opaque Error Messages Numerous people have reported receiving an error message from RSiteCatalyst similar to the following: 'names' attribute [1] must ... [Read more...]

Sessionizing Log Data Using dplyr [Follow-up]

January 13, 2015 | Randy Zwitch

Last week, I wrote a blog post showing how to sessionize log data using standard SQL. The main idea of that post is that if your analytics platform supports window functions (like Postgres and Hive do), you can make quick work out of sessionizing logs. Here’s the winning query:... [Read more...]

RSiteCatalyst Version 1.4.3 Release Notes

January 6, 2015 | Randy Zwitch

It’s a new year, so…new version of RSiteCatalyst on CRAN! For the most part, this release fixes a handful of bugs that weren’t noticed with the prior release 1.4.2 (oops!), but there are pieces of additional functionality. New functionality: Data Feed monitoring For those of you having hourly ... [Read more...]

Review: Data Science at the Command Line

December 15, 2014 | Randy Zwitch

Admission: I didn’t really know how computers worked until around 2012. For the majority of my career, I’ve worked for large companies with centralized IT functions. Like many statisticians, I fell into a comfortable position of learning SAS in a Windows environment, had Ops people to fix any Unix ... [Read more...]

RSiteCatalyst Version 1.4.2 Release Notes

December 3, 2014 | Randy Zwitch

RSiteCatalyst version 1.4.2 is now available on CRAN. This update was primarily bug fixes with one additional feature added. Fixed QueueRanked function to allow multiple SAINT classifications to be specified. This allows for breaking down a SAINT classification with another SAINT classification, such as breaking down tracking codes by marketing channel ... [Read more...]

RSiteCatalyst Version 1.4.1 Release Notes

November 10, 2014 | Randy Zwitch

Changes Version 1.4.1 of RSiteCatalyst is now available on CRAN. There were a handful of bug fixes and new features added, including: Fixed bug in QueueRanked function where only 10 results were returned when requesting multiple element reports. Function now returns up to 50,000 per breakdown (API limit) Created better error message to ... [Read more...]

Evaluating BreakoutDetection

November 6, 2014 | Randy Zwitch

A couple of weeks ago, Twitter open-sourced their BreakoutDetection package for R, a package designed to determine shifts in time-series data. The Twitter announcement does a great job of explaining the main technique for detection (E-Divisive with Medians), so I won’t rehash that material here. Rather, I wanted to ... [Read more...]
1 2 3

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)