389 search results for "hadoop"

Some R Conferences in 2015

February 19, 2015
By

by Joseph Rickert For the past few years, the Strata + Hadoop World Conference in San Jose has kicked off my personal conference season. With its focus on Data Science, Strata always seems to present some interesting R related talks, and I am looking forward to the various events over the next couple of days. But, Strata and other...

Read more »

SAS to R Migration

February 16, 2015
By

By Andy Nicholls, Head of Consulting (UK) Why do it? Mango has been involved in an increasing number of engagements where customers are seeking to migrate from SAS to R.  There are a number of different business drivers for these … Continue reading →

Read more »

The HP Workshop on Distributed Computing in R

February 12, 2015
By
The HP Workshop on Distributed Computing in R

by Joseph Rickert In the last week of January, HP Labs in Palo Alto hosted a workshop on distributed computing in R that was organized by Indrajit Roy (Principal Researcher, HP) and Michael Lawrence (Genentech and R-core member). The goal was to bring together a small group of R developers with significant experience in parallel and distributed computing to...

Read more »

Enhancing R for Distributed Computing

February 10, 2015
By
Enhancing R for Distributed Computing

A summary of a recent workshop at HP Labs addressed “Distributed Computing in R”

Read more »

What to expect from Strata Conference 2015? An empirical outlook.

February 9, 2015
By
What to expect from Strata Conference 2015? An empirical outlook.

In one week, the 2015 edition of Strata Conference (or rather: Strata + Hadoop World) will open its doors to data scientists and big data practitioners from all over the world. What will be the most important big data technology trends for this year? As last year, I ran an analysis on the

Read more »

Quickcheck: Randomized unit testing for R

February 4, 2015
By

Hadley Wickham's testthat package has been a boon for R package authors, making it easy to write tests to verify that your code is working directly, and alerting you when you make changes to your code that inadvertently breaks things. For the RHadoop project, though, developer Antonio Piccolboni needed a different testing framework, that included the possibility of writing...

Read more »

Revolution Analytics joins Microsoft

January 23, 2015
By
Revolution Analytics joins Microsoft

by David Smith, Chief Community Officer On behalf of the entire Revolution Analytics team I am excited to announce that Revolution Analytics is joining forces with Microsoft to bring R to even more enterprises. Microsoft announced today that it will acquire Revolution Analytics. Now, Microsoft might seem like a strange bedfellow for an open-source company, but the company continues...

Read more »

A first look at Spark

January 22, 2015
By
A first look at Spark

by Joseph Rickert Apache Spark, the open-source, cluster computing framework originally developed in the AMPLab at UC Berkeley and now championed by Databricks is rapidly moving from the bleeding edge of data science to the mainstream. Interest in Spark, demand for training and overall hype is on a trajectory to match the frenzy surrounding Hadoop in recent years. Next...

Read more »

REVIEW OF THE UNIVERSITY OF WASHINGTON DATA SCIENCE CERTIFICATE PROGRAM

January 16, 2015
By
REVIEW OF THE UNIVERSITY OF WASHINGTON DATA SCIENCE CERTIFICATE PROGRAM

When I was looking for Data Science certificate programs back in 2013, there were only a few available and most had only graduated one or two cohorts. Even worse, I could not find a single review for any of them. So, this is my review of the University of Washington Data Science certificate. Background: ...

Read more »

R in Nature, Mashable

December 31, 2014
By
R in Nature, Mashable

R was recently the subject of a feature article in the prestigious science magazine Nature: Programming tools: Adventures with R. Besides being free, R is popular partly because it presents different faces to different users. It is, first and foremost, a programming language — requiring input through a command line, which may seem forbidding to non-coders. But beginners can...

Read more »