Blog Archives

Association Rules and Market Basket Analysis with R

April 8, 2015
By

In today's data-oriented world, just about every retailer has amassed a huge database of purchase transaction. Each transaction consists of a number of products that have been purchased together. A natural question that you could answer from this database is: What products are typically purchased together? This is called Market Basket Analysis (or Affinity Analysis). A closely related question...

Read more »

DeployR 7.4 released

April 3, 2015
By

A new version of DeployR, the server-based framework that provides simple and secure R integration for application developers, is now available. (If you're new to DeployR, take a look at the DeployR Overview or download the white paper, Using DeployR to Solve the R Integration Problem.) This update brings several new features, including: New R Session Process Controls, which...

Read more »

A minor update: Revolution R Open 8.0.2

April 1, 2015
By

Revolution R Open 8.0.2 is now available from MRAN. If you're already using Revolution R Open, you won't find any major changes. This release fixes a couple of bugs, includes a new version of the checkpoint package, and splits the installation into two parts on Windows and Linux (with a separate installer for the multithreaded MKL Math libraries, which...

Read more »

The most common R error messages

March 30, 2015
By

R has something of a reputation for generating, shall we say, obscure error messages like this: Error in model.frame.default(formula = y ~ female + DNC + SE_region + : could not find function "function (object, ...) nobject" One tip for dealing with error messages is to ignore everything between "Error in" and the colon: unless you are running a...

Read more »

Replay: Reproducible data analysis with the checkpoint package

March 27, 2015
By

Thanks to all who attended my webinar earlier this week, Reproducibility with Revolution R Open and the Checkpoint Package. If you missed the live session, you can catch up with the slides and video replay which I've embedded below. If you just want to check out the demo of the checkpoint package, it starts at 18:30 in the video...

Read more »

Participate in the 2015 Rexer Data Mining Survey

March 25, 2015
By
Participate in the 2015 Rexer Data Mining Survey

Since 2007, Karl Rexer has been collecting data on the tools, skills and practices of statisticians and data miners. Over the years, his semi-annual Data Miner Survey has expanded in scope, and now includes research on the topics of data science, big data, and analytics applications on business. Unlike research conducted by big-name analyst firms, Karl shares the results...

Read more »

Tomorrow, 10AMPT: Live webinar on "checkpoint" package

March 23, 2015
By

A quick heads-up that tomorrow (Tuesday) at 10AM Pacific Time I'll be giving a live (and free) webinar: Reproducibility with Revolution R Open and the Checkpoint Package. I'll show you how to use the latest version of the checkpoint package to write R scripts without having to worry about unexpected package updates on CRAN. Here's the full abstract: Good...

Read more »

NYC is a city that does sleep, a bit

March 20, 2015
By
NYC is a city that does sleep, a bit

The On Broadway project collected more than 600,000 photographs taken near Broadway in New York City during a six-month period in 2014. If you're in New York, you can explore the images in an interactive installation at the New York Public Library though the end of this year. You can also explore them in your browser using this online...

Read more »

Updated checkpoint package: faster reproducibility with more feedback

March 18, 2015
By

A new version of the checkpoint package for R has just been released on CRAN. With the checkpoint package, you can easily: Write R scripts or projects using CRAN package versions from a specific point in time; Share R scripts with others that will automatically install the appropriate package versions (no need to manually install CRAN packages); Write R...

Read more »

The impact of open source software on the data science revolution

March 16, 2015
By

It's hard to overstate the role of open-source software in the data science revolution. Tools like Hadoop, Spark, R, and Python are essential parts of the modern data science toolkit. These tools are likewise part of the solutions built by the Consulting Services group at Revolution Analytics. Our VP of Professional Services, Neera Talbert, shares her view on the...

Read more »