501 search results for "hadoop"

Apache Spark integrated with Microsoft R Server for Hadoop

June 28, 2016
By

by Bill Jacobs, Microsoft Advanced Analytics Product Marketing They say that time is infinite. Seem to me data is fast becoming the same. Or perhaps it's becoming true that our thirst for speed is providing eternal job security to computer scientists who can deliver it. Apache Spark, one of the Apache Foundation's fastest-growing open source projects, delivers new levels...

Read more »

Integrating R with Apache Hadoop

May 27, 2016
By
Integrating R with Apache Hadoop

Integrating R to work on Hadoop is to address the requirement to scale R program to work with petabyte scale data. The primary goal of this post is to elaborate different techniques for integrating R with Hadoop. Approach 1: Using R and Streaming APIs in Hadoop In order to integrate an R function with Hadoop Related Post

Read more »

How to use SparkR in Cloudera Hadoop

Suppose you are an avid R user, and you would like to use SparkR in Cloudera Hadoop; unfortunately, as of the latest CDH version (5.7), SparkR is still not supported (and, according to a recent discussion in the Cloudera forums, we shouldn’t expect this to happen anytime soon). Is there anything  you can do? Well, indeed there is. In...

Read more »

Nina Zumel and John Mount part of R Day at Strata + Hadoop World in San Jose 2016

January 17, 2016
By

Nina Zumel and I are honored to have been invited to be part of Strata + Hadoop World in San Jose 2016 R Day organized by RStudio and O’Reilly. We have written a lot on the topic of model validation in R and we are very excited to distill it down to an exciting tutorial. … Continue reading...

Read more »

Manipulating Hive tables with Oracle R connectors for Hadoop

November 12, 2015
By

In this post, we’ll have a look at how easy it is to manipulate Hive tables using Oracle R connectors for Hadoop (ORCH, presently known as Oracle R Advanced Analytics for Hadoop – ORAAH). We will use the weblog data from Athens Datathon 2015, which we have already loaded in a Hive table named weblogs, as described in more...

Read more »

Making it easy to use RHadoop on HDInsight Hadoop clusters

September 25, 2015
By
Making it easy to use RHadoop on HDInsight Hadoop clusters

The RHadoop packages make it easy to connect R to Hadoop data (rhdfs), and write map-reduce operations in the R language (rmr2) to process that data using the power of the nodes in a Hadoop cluster. But getting the Hadoop cluster configured, with R and all the necessary packages installed on each node, hasn't always been so easy. But...

Read more »

Ofuro, start H2O on Hadoop from R

August 21, 2015
By
Ofuro, start H2O on Hadoop from R

tl;dr I made a simple functionality to start H2O on hadoop from R. You can easily start H2O on hadoop, run your analytics and close all the processes without occuppying Hadoop nodes and memory all the time. I like to take a bath. Fill a bath and warm up in there is a perfect refreshment after a hard working day. I...

Read more »

Combining Hadoop, Spark, R, SparkR and Shiny…. and it works :-)

July 9, 2015
By
Combining Hadoop, Spark, R, SparkR and Shiny…. and it works :-)

A long time ago in 1991 I had my first programming course (Modula 2) at the Vrije University in Amsterdam. I spend months behind a terminal with a green monochrome display doing the programming exercises using VI. Do you remeber Shift … Continue reading →

Read more »

News from UseR!2015 – the RHadoop tutorial

July 1, 2015
By
News from UseR!2015 – the RHadoop tutorial

by Andrie de Vries Today is the first day of UseR!2015 conference in Aalborg in Northern Denmark. But yesterday was a day packed with 16 tutorials on a range of interesting topics. I submitted a proposal many months ago to run a session on using R in Hadoop and was very happy to selected to run a session in...

Read more »

Using Hadoop with R: It Depends.

June 19, 2015
By

by Bill Jacobs, Director Technical Sales, Microsoft Advanced Analytics In the course of working with our Hadoop users, we are often asked, what's the best way to integrate R with Hadoop? The answer, in nearly all cases is, It depends. Alternatives ranging from open source R on workstations, to parallelized commercial products like Revolution R Enterprise and many steps...

Read more »

Sponsors

Mango solutions



RStudio homepage



Zero Inflated Models and Generalized Linear Mixed Models with R

Quantide: statistical consulting and training

datasociety

http://www.eoda.de





ODSC

ODSC

CRC R books series





Six Sigma Online Training









Contact us if you wish to help support R-bloggers, and place your banner here.

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)