377 search results for "hadoop"

The HP Workshop on Distributed Computing in R

February 12, 2015
By
The HP Workshop on Distributed Computing in R

by Joseph Rickert In the last week of January, HP Labs in Palo Alto hosted a workshop on distributed computing in R that was organized by Indrajit Roy (Principal Researcher, HP) and Michael Lawrence (Genentech and R-core member). The goal was to bring together a small group of R developers with significant experience in parallel and distributed computing to...

Read more »

Enhancing R for Distributed Computing

February 10, 2015
By
Enhancing R for Distributed Computing

A summary of a recent workshop at HP Labs addressed “Distributed Computing in R”

Read more »

What to expect from Strata Conference 2015? An empirical outlook.

February 9, 2015
By
What to expect from Strata Conference 2015? An empirical outlook.

In one week, the 2015 edition of Strata Conference (or rather: Strata + Hadoop World) will open its doors to data scientists and big data practitioners from all over the world. What will be the most important big data technology trends for this year? As last year, I ran an analysis on the

Read more »

Quickcheck: Randomized unit testing for R

February 4, 2015
By

Hadley Wickham's testthat package has been a boon for R package authors, making it easy to write tests to verify that your code is working directly, and alerting you when you make changes to your code that inadvertently breaks things. For the RHadoop project, though, developer Antonio Piccolboni needed a different testing framework, that included the possibility of writing...

Read more »

Revolution Analytics joins Microsoft

January 23, 2015
By
Revolution Analytics joins Microsoft

by David Smith, Chief Community Officer On behalf of the entire Revolution Analytics team I am excited to announce that Revolution Analytics is joining forces with Microsoft to bring R to even more enterprises. Microsoft announced today that it will acquire Revolution Analytics. Now, Microsoft might seem like a strange bedfellow for an open-source company, but the company continues...

Read more »

A first look at Spark

January 22, 2015
By
A first look at Spark

by Joseph Rickert Apache Spark, the open-source, cluster computing framework originally developed in the AMPLab at UC Berkeley and now championed by Databricks is rapidly moving from the bleeding edge of data science to the mainstream. Interest in Spark, demand for training and overall hype is on a trajectory to match the frenzy surrounding Hadoop in recent years. Next...

Read more »

REVIEW OF THE UNIVERSITY OF WASHINGTON DATA SCIENCE CERTIFICATE PROGRAM

January 16, 2015
By
REVIEW OF THE UNIVERSITY OF WASHINGTON DATA SCIENCE CERTIFICATE PROGRAM

When I was looking for Data Science certificate programs back in 2013, there were only a few available and most had only graduated one or two cohorts. Even worse, I could not find a single review for any of them. So, this is my review of the University of Washington Data Science certificate. Background: ...

Read more »

R in Nature, Mashable

December 31, 2014
By
R in Nature, Mashable

R was recently the subject of a feature article in the prestigious science magazine Nature: Programming tools: Adventures with R. Besides being free, R is popular partly because it presents different faces to different users. It is, first and foremost, a programming language — requiring input through a command line, which may seem forbidding to non-coders. But beginners can...

Read more »

R wins a 2014 Bossie Award

December 29, 2014
By

I missed this when it was announced back on September 29, but R won a 2014 Bossie Award for best open-source big-data tools from InfoWorld (see entry number 5): A specialized computer language for statistical analysis, R continues to evolve to meet new challenges. Since displacing lisp-stat in the early 2000s, R is the de-facto statistical processing language, with...

Read more »

[NYC] Featured R experts Meetup, R classes and 12 week Data Science Bootcamp

December 28, 2014
By
[NYC] Featured R experts Meetup, R classes and 12 week Data Science Bootcamp

There are a few exciting announcements I would love to share with R community. We feel very honored to host meetup and class offered by Kaggle #1 ranked Data Scientist, Owen Zhang and book author of Applied predictive modeling, Max Kuhn. Featured R experts meetup Featured talk given by Kaggle world ranked #1 Owen Zhang

Read more »