In-Hadoop R-based Analytics coming to Cloudera

August 27, 2013
By

(This article was first published on Revolutions, and kindly contributed to R-bloggers)

Revolution Analytics has teamed up with Cloudera to bring the scalable data manipulation and statistical modeling algorithms of Revolution R Enteprise to the massively-parallel computing environments of CDH3 and CDH4 Hadoop clusters. As ZDNet reports:

Specifically, the upcoming version 7.0 of the Revolution R Enterpise distribution and its ScaleR algorithms will run inside CDH3 and CDH4, eliminating the need to move data out of CDH and into the conventional storage that R code would otherwise require.  Avoiding such data movement will allow predictive analytics functionality implemented in R to execute more immediately and quickly, allowing for more real-time analysis and generally better performance even in non-real-time applications.

With the release of Revolution R Enterprise 7 in late 2013, users of Cloudera's Distribution for Apache Hadoop will have the ability to quickly and easily invoke R-powered predictive models, and push beyond simple summaries, queries and data visualization to produce game-changing insights from data managed by the Hadoop environment. Data scientists will use the R language to create powerful predictive models like logistic regression, decision trees, and clustering models in data in Hadoop, all without having to learn to write MapReduce in Java, Python or other languages.

You can learn more about this forthcoming integration in today's live webinar, High Performance Predictive Analytics in R and Hadoop, or in the press release linked below.

Revolution Analytics press releases: Revolution Analytics to Support In-Hadoop Big Data Predictive Analytics for Cloudera

To leave a comment for the author, please follow the link and comment on his blog: Revolutions.

R-bloggers.com offers daily e-mail updates about R news and tutorials on topics such as: visualization (ggplot2, Boxplots, maps, animation), programming (RStudio, Sweave, LaTeX, SQL, Eclipse, git, hadoop, Web Scraping) statistics (regression, PCA, time series, trading) and more...



If you got this far, why not subscribe for updates from the site? Choose your flavor: e-mail, twitter, RSS, or facebook...

Comments are closed.