418 search results for "hadoop"

Combining Hadoop, Spark, R, SparkR and Shiny…. and it works :-)

July 9, 2015
By
Combining Hadoop, Spark, R, SparkR and Shiny…. and it works :-)

A long time ago in 1991 I had my first programming course (Modula 2) at the Vrije University in Amsterdam. I spend months behind a terminal with a green monochrome display doing the programming exercises using VI. Do you remeber Shift … Continue reading →

Read more »

News from UseR!2015 – the RHadoop tutorial

July 1, 2015
By
News from UseR!2015 – the RHadoop tutorial

by Andrie de Vries Today is the first day of UseR!2015 conference in Aalborg in Northern Denmark. But yesterday was a day packed with 16 tutorials on a range of interesting topics. I submitted a proposal many months ago to run a session on using R in Hadoop and was very happy to selected to run a session in...

Read more »

Using Hadoop with R: It Depends.

June 19, 2015
By

by Bill Jacobs, Director Technical Sales, Microsoft Advanced Analytics In the course of working with our Hadoop users, we are often asked, what's the best way to integrate R with Hadoop? The answer, in nearly all cases is, It depends. Alternatives ranging from open source R on workstations, to parallelized commercial products like Revolution R Enterprise and many steps...

Read more »

The 2015 Strata + Hadoop World London

May 12, 2015
By
The 2015 Strata + Hadoop World London

By Mark Sellors, Mango UK On Tuesday 5th of May, O’Reilly Media and Cloudera, a distributor of a Hadoop based big data platform, brought their ‘Strata + Hadoop World‘ conference to London. The conference features a mixture of Data Science, … Continue reading →

Read more »

Using Hadoop Streaming API to perform a word count job in R and C++

February 25, 2015
By

by Marek Gagolewski, Maciej Bartoszuk, Anna Cena, and Jan Lasek (Rexamine). Introduction In a recent blog post we explained how we managed to set up a working Hadoop environment on a few CentOS7 machines. To test the installation, let’s play…Read more ›

Read more »

Hadoop and Neo4j

February 23, 2015
By
Hadoop and Neo4j

Hadoop is being widely used for processing big data and Neo4j is a popular open-source graph database. When doing social network analysis on big data, a “natural” thought is to use them together. Unfortunately, Neo4j cannot work directly on HDFS … Continue reading →

Read more »

R for in-Hadoop Analytics: with Big Data Developer meetup Group

October 26, 2014
By

We were honoured to have a joint event with the Big Data Developer Meetup Group where we were introduced to IBMs BigR package for in-Hadoop Analytics. Mr. Rafeal Coss and Mr. Brandon MacKenzie demonstrated the workings of BigR, the integration of R into Hadoop using IBM BigInsights. You can download the slides of this presentation by clicking here. BigR allows R users to...

Read more »

Find us at Strata Conference and Hadoop World 2014!

October 2, 2014
By

SupStat Analytics and Transwarp Technologies will be at the 2014 Strata Conference and Hadoop World showcasing the power of Hadoop and Spark computing with R analytics. We’re excited to be presenting to the data science world the Transwarp Data Hub, an integrated storage, processing, and analytics platform that delivers up to 100 times faster performance

Read more »

Meet us at R Day and at the Strata+Hadoop World NYC Oct 15-17, 2014

September 30, 2014
By
Meet us at R Day and at the Strata+Hadoop World NYC Oct 15-17, 2014

Are you headed to Strata? It’s just around the corner! We particularly hope to see you at R Day on October 15, where we will cover a raft of current topics that analysts and R users need to pay attention to. The R Day tutorials come from Hadley Wickham, Winston Chang, Garrett Grolemund, J.J. Allaire, and

Read more »

Become an effective data hacker with the R-Hadoop stack

September 24, 2014
By

In discussion with several data scientists, Will Stanton (a data scientist with Return Path) learned that a common concern is: what software should I be using? There are many options out there, but what is the best platform to be an effective "data hacker"? Will recommends using a technology stack with R and Hadoop, which allows data scientists "to...

Read more »