496 search results for "hadoop"

Integrating R with Apache Hadoop

May 27, 2016
By
Integrating R with Apache Hadoop

Integrating R to work on Hadoop is to address the requirement to scale R program to work with petabyte scale data. The primary goal of this post is to elaborate different techniques for integrating R with Hadoop. Approach 1: Using R and Streaming APIs in Hadoop In order to integrate an R function with Hadoop Related Post

Read more »

Nina Zumel and John Mount part of R Day at Strata + Hadoop World in San Jose 2016

January 17, 2016
By

Nina Zumel and I are honored to have been invited to be part of Strata + Hadoop World in San Jose 2016 R Day organized by RStudio and O’Reilly. We have written a lot on the topic of model validation in R and we are very excited to distill it down to an exciting tutorial. … Continue reading...

Read more »

Making it easy to use RHadoop on HDInsight Hadoop clusters

September 25, 2015
By
Making it easy to use RHadoop on HDInsight Hadoop clusters

The RHadoop packages make it easy to connect R to Hadoop data (rhdfs), and write map-reduce operations in the R language (rmr2) to process that data using the power of the nodes in a Hadoop cluster. But getting the Hadoop cluster configured, with R and all the necessary packages installed on each node, hasn't always been so easy. But...

Read more »

Ofuro, start H2O on Hadoop from R

August 21, 2015
By
Ofuro, start H2O on Hadoop from R

tl;dr I made a simple functionality to start H2O on hadoop from R. You can easily start H2O on hadoop, run your analytics and close all the processes without occuppying Hadoop nodes and memory all the time. I like to take a bath. Fill a bath and warm up in there is a perfect refreshment after a hard working day. I...

Read more »

Combining Hadoop, Spark, R, SparkR and Shiny…. and it works :-)

July 9, 2015
By
Combining Hadoop, Spark, R, SparkR and Shiny…. and it works :-)

A long time ago in 1991 I had my first programming course (Modula 2) at the Vrije University in Amsterdam. I spend months behind a terminal with a green monochrome display doing the programming exercises using VI. Do you remeber Shift … Continue reading →

Read more »

Combining Hadoop, Spark, R, SparkR and Shiny…. and it works :-)

July 9, 2015
By
Combining Hadoop, Spark, R, SparkR and Shiny…. and it works :-)

A long time ago in 1991 I had my first programming course (Modula 2) at the Vrije University in Amsterdam. I spend months behind a terminal with a green monochrome display doing the programming exercises using VI. Do you remember Shift … Continue reading →

Read more »

News from UseR!2015 – the RHadoop tutorial

July 1, 2015
By
News from UseR!2015 – the RHadoop tutorial

by Andrie de Vries Today is the first day of UseR!2015 conference in Aalborg in Northern Denmark. But yesterday was a day packed with 16 tutorials on a range of interesting topics. I submitted a proposal many months ago to run a session on using R in Hadoop and was very happy to selected to run a session in...

Read more »

Using Hadoop with R: It Depends.

June 19, 2015
By

by Bill Jacobs, Director Technical Sales, Microsoft Advanced Analytics In the course of working with our Hadoop users, we are often asked, what's the best way to integrate R with Hadoop? The answer, in nearly all cases is, It depends. Alternatives ranging from open source R on workstations, to parallelized commercial products like Revolution R Enterprise and many steps...

Read more »

The 2015 Strata + Hadoop World London

May 12, 2015
By
The 2015 Strata + Hadoop World London

By Mark Sellors, Mango UK On Tuesday 5th of May, O’Reilly Media and Cloudera, a distributor of a Hadoop based big data platform, brought their ‘Strata + Hadoop World‘ conference to London. The conference features a mixture of Data Science, … Continue reading →

Read more »

Using Hadoop Streaming API to perform a word count job in R and C++

February 25, 2015
By

by Marek Gagolewski, Maciej Bartoszuk, Anna Cena, and Jan Lasek (Rexamine). Introduction In a recent blog post we explained how we managed to set up a working Hadoop environment on a few CentOS7 machines. To test the installation, let’s play…Read more ›

Read more »

Sponsors

Mango solutions



RStudio homepage



Zero Inflated Models and Generalized Linear Mixed Models with R

Quantide: statistical consulting and training



http://www.eoda.de









ODSC

CRC R books series













Contact us if you wish to help support R-bloggers, and place your banner here.

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)