373 search results for "hadoop"

Revolution Analytics joins Microsoft

January 23, 2015
By
Revolution Analytics joins Microsoft

by David Smith, Chief Community Officer On behalf of the entire Revolution Analytics team I am excited to announce that Revolution Analytics is joining forces with Microsoft to bring R to even more enterprises. Microsoft announced today that it will acquire Revolution Analytics. Now, Microsoft might seem like a strange bedfellow for an open-source company, but the company continues...

Read more »

A first look at Spark

January 22, 2015
By
A first look at Spark

by Joseph Rickert Apache Spark, the open-source, cluster computing framework originally developed in the AMPLab at UC Berkeley and now championed by Databricks is rapidly moving from the bleeding edge of data science to the mainstream. Interest in Spark, demand for training and overall hype is on a trajectory to match the frenzy surrounding Hadoop in recent years. Next...

Read more »

REVIEW OF THE UNIVERSITY OF WASHINGTON DATA SCIENCE CERTIFICATE PROGRAM

January 16, 2015
By
REVIEW OF THE UNIVERSITY OF WASHINGTON DATA SCIENCE CERTIFICATE PROGRAM

When I was looking for Data Science certificate programs back in 2013, there were only a few available and most had only graduated one or two cohorts. Even worse, I could not find a single review for any of them. So, this is my review of the University of Washington Data Science certificate. Background: ...

Read more »

R in Nature, Mashable

December 31, 2014
By
R in Nature, Mashable

R was recently the subject of a feature article in the prestigious science magazine Nature: Programming tools: Adventures with R. Besides being free, R is popular partly because it presents different faces to different users. It is, first and foremost, a programming language — requiring input through a command line, which may seem forbidding to non-coders. But beginners can...

Read more »

R wins a 2014 Bossie Award

December 29, 2014
By

I missed this when it was announced back on September 29, but R won a 2014 Bossie Award for best open-source big-data tools from InfoWorld (see entry number 5): A specialized computer language for statistical analysis, R continues to evolve to meet new challenges. Since displacing lisp-stat in the early 2000s, R is the de-facto statistical processing language, with...

Read more »

[NYC] Featured R experts Meetup, R classes and 12 week Data Science Bootcamp

December 28, 2014
By
[NYC] Featured R experts Meetup, R classes and 12 week Data Science Bootcamp

There are a few exciting announcements I would love to share with R community. We feel very honored to host meetup and class offered by Kaggle #1 ranked Data Scientist, Owen Zhang and book author of Applied predictive modeling, Max Kuhn. Featured R experts meetup Featured talk given by Kaggle world ranked #1 Owen Zhang

Read more »

Snowdoop/partools Update

December 27, 2014
By
Snowdoop/partools Update

I’ve put together an updated version of my partools package, including Snowdoop, an alternative to MapReduce algorithms.  You can download it here, version 1.0.1. To review:  The idea of Snowdoop is to create your own file chunking, rather than having something like Hadoop do it for you, and then using ordinary R coding to perform … Continue reading...

Read more »

New ASA Guidelines for Undergraduate Statistics Programs

December 12, 2014
By
New ASA Guidelines for Undergraduate Statistics Programs

by Joseph Rickert The American Statistical Association (ASA) Undergraduate Guidelines Workgroup recently published the report Curriculum Guidelines for Undergraduate Programs in Statistical Science. Although intended for educators setting up or revamping Stats programs at colleges and universities, this concise, 17 page document should be good reading for anyone who wants to take charge of their own education in learning...

Read more »

Thursday Dec 11: Webinar on sports analytics with R and Storm

December 8, 2014
By

A quick heads-up that this Thursday (December 11), Allen Day from MapR and Bill Jacobs from Revolution Analytics will be live presenting a new webinar, Batter Up! Advanced Sports Analytics with R and Storm. The analysis will be of baseball data, but the webinar will be of interest to anyone interested in doing large-scale statistical analysis with R of...

Read more »

Snowdoop, Part II

December 7, 2014
By
Snowdoop, Part II

In my last post, I questioned whether the fancy Big Data processing tools such as Hadoop and Spark are really necessary for us R users.  My argument was that (a) these tools tend to be difficult to install and configure, especially for non-geeks; (b) the tools require learning new computation paradigms and function calls; and … Continue reading...

Read more »