389 search results for "hadoop"

R wins a 2014 Bossie Award

December 29, 2014
By

I missed this when it was announced back on September 29, but R won a 2014 Bossie Award for best open-source big-data tools from InfoWorld (see entry number 5): A specialized computer language for statistical analysis, R continues to evolve to meet new challenges. Since displacing lisp-stat in the early 2000s, R is the de-facto statistical processing language, with...

Read more »

[NYC] Featured R experts Meetup, R classes and 12 week Data Science Bootcamp

December 28, 2014
By
[NYC] Featured R experts Meetup, R classes and 12 week Data Science Bootcamp

There are a few exciting announcements I would love to share with R community. We feel very honored to host meetup and class offered by Kaggle #1 ranked Data Scientist, Owen Zhang and book author of Applied predictive modeling, Max Kuhn. Featured R experts meetup Featured talk given by Kaggle world ranked #1 Owen Zhang

Read more »

Snowdoop/partools Update

December 27, 2014
By
Snowdoop/partools Update

I’ve put together an updated version of my partools package, including Snowdoop, an alternative to MapReduce algorithms.  You can download it here, version 1.0.1. To review:  The idea of Snowdoop is to create your own file chunking, rather than having something like Hadoop do it for you, and then using ordinary R coding to perform … Continue reading...

Read more »

New ASA Guidelines for Undergraduate Statistics Programs

December 12, 2014
By
New ASA Guidelines for Undergraduate Statistics Programs

by Joseph Rickert The American Statistical Association (ASA) Undergraduate Guidelines Workgroup recently published the report Curriculum Guidelines for Undergraduate Programs in Statistical Science. Although intended for educators setting up or revamping Stats programs at colleges and universities, this concise, 17 page document should be good reading for anyone who wants to take charge of their own education in learning...

Read more »

Thursday Dec 11: Webinar on sports analytics with R and Storm

December 8, 2014
By

A quick heads-up that this Thursday (December 11), Allen Day from MapR and Bill Jacobs from Revolution Analytics will be live presenting a new webinar, Batter Up! Advanced Sports Analytics with R and Storm. The analysis will be of baseball data, but the webinar will be of interest to anyone interested in doing large-scale statistical analysis with R of...

Read more »

Snowdoop, Part II

December 7, 2014
By
Snowdoop, Part II

In my last post, I questioned whether the fancy Big Data processing tools such as Hadoop and Spark are really necessary for us R users.  My argument was that (a) these tools tend to be difficult to install and configure, especially for non-geeks; (b) the tools require learning new computation paradigms and function calls; and … Continue reading...

Read more »

How About a “Snowdoop” Package?

November 26, 2014
By

Along with all the hoopla on Big Data in recent years came a lot of hype on Hadoop.  This eventually spread to the R world, with sophisticated packages being developed such as rmr to run on top of Hadoop. Hadoop made it convenient to process data in very large distributed databases, and also convenient to create … Continue reading...

Read more »

LA R Meetup Summary: Highlights from useR! 2014 – Part 2

November 18, 2014
By
LA R Meetup Summary: Highlights from useR! 2014 – Part 2

Last week the LA R meetup featured another round of 5 speakers each highlighting a...

Read more »

11 new R jobs (for November 18th 2014)

November 18, 2014
By
11 new R jobs (for November 18th 2014)

This is the bimonthly R Jobs post (for 2014-11-18), based on the R-bloggers’ sister website: R-users.com. If you are an employer who is looking to hire people from the R community, please visit this link to post a new R job (it’s free, and registration takes less than 10 seconds). If you are a job seekers, please follow the links below to learn more and apply for your job of interest (or visit previous...

Read more »

Mobility from Mobile Phones

November 12, 2014
By
Mobility from Mobile Phones

I have worked on big data in my work with QuBit in London. In my research I increasingly find the tools I learnt there to be extremely useful. The keywords are smart data management for big data, such as hadoop and hive for querying just the right set of data to work with. I am

Read more »