Real-Time Big Data Analytics: Emerging Architecture

July 19, 2013

(This article was first published on Revolutions, and kindly contributed to R-bloggers)

RTBDA-coverO'Reilly Media has published a new whitepaper, Real-Time Big Data Analytics: Emerging Architecture. This 32-page document describes the processes and components necessary for getting on-demand information from big-data stores such as Hadoop. It answers the questions "How fast is fast?" and "How real is real-time?" and "how big is big?", and provides practical guidance for implementing real-time analytics systems. 

The author (Mike Barlow) interviewed a broad range of experts and practitioners, including Justin Erickson (Cloudera), Matei Zaharia (creator of Spark), Nathan Marz (Storm, Cascalog), Dhiraj Rajaram (Mu Sigma) and yours truly (I describe the role of R in the real-time big data analytics stack). The guide offers a broad range of perspectives and distils them into a set of best practices in a clear and approachable way. It's available for download as a free PDF (with registration) at the link below.

O'Reilly: Real-Time Big Data Analytics: Emerging Architecture

To leave a comment for the author, please follow the link and comment on their blog: Revolutions. offers daily e-mail updates about R news and tutorials on topics such as: Data science, Big Data, R jobs, visualization (ggplot2, Boxplots, maps, animation), programming (RStudio, Sweave, LaTeX, SQL, Eclipse, git, hadoop, Web Scraping) statistics (regression, PCA, time series, trading) and more...

If you got this far, why not subscribe for updates from the site? Choose your flavor: e-mail, twitter, RSS, or facebook...

Comments are closed.

Search R-bloggers


Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)