Using R and Hadoop to analyze VOIP data

November 8, 2010
By

(This article was first published on Revolutions, and kindly contributed to R-bloggers)

Last month, the newest member of Revolution's engineering team, Saptarshi Guha, gave a presentation at Hadoop World 2010 on using R and Hadoop to analyze 1.3 billion voice-over-IP packets to identify calls and measure call quality. Saptarshi, of course, is the author of RHIPE, which lets R programmers write map-reduce algorithms in the Hadoop framework without needing to learn Java. With R running on each Hadoop node, Saptarshi used R's data analysis functions (such as robust regression) to process almost 100 Gb of data in just a few minutes.

The slides for Saptarshi's talk are now available to view at the Hadoop World website (linked below), or you can download a PDF version (7.3Mb).

Hadoop World 2010: Voice over IP: Studying Traffice Characteristics for Quality of Service using R and Hadoop

 

To leave a comment for the author, please follow the link and comment on his blog: Revolutions.

R-bloggers.com offers daily e-mail updates about R news and tutorials on topics such as: visualization (ggplot2, Boxplots, maps, animation), programming (RStudio, Sweave, LaTeX, SQL, Eclipse, git, hadoop, Web Scraping) statistics (regression, PCA, time series, trading) and more...



If you got this far, why not subscribe for updates from the site? Choose your flavor: e-mail, twitter, RSS, or facebook...

Tags: ,

Comments are closed.