353 search results for "hadoop"

How About a “Snowdoop” Package?

November 26, 2014
By

Along with all the hoopla on Big Data in recent years came a lot of hype on Hadoop.  This eventually spread to the R world, with sophisticated packages being developed such as rmr to run on top of Hadoop. Hadoop made it convenient to process data in very large distributed databases, and also convenient to create … Continue reading...

Read more »

LA R Meetup Summary: Highlights from useR! 2014 – Part 2

November 18, 2014
By
LA R Meetup Summary: Highlights from useR! 2014 – Part 2

Last week the LA R meetup featured another round of 5 speakers each highlighting a...

Read more »

11 new R jobs (for November 18th 2014)

November 18, 2014
By
11 new R jobs (for November 18th 2014)

This is the bimonthly R Jobs post (for 2014-11-18), based on the R-bloggers’ sister website: R-users.com. If you are an employer who is looking to hire people from the R community, please visit this link to post a new R job (it’s free, and registration takes less than 10 seconds). If you are a job seekers, please follow the links below to learn more and apply for your job of interest (or visit previous...

Read more »

Mobility from Mobile Phones

November 12, 2014
By
Mobility from Mobile Phones

I have worked on big data in my work with QuBit in London. In my research I increasingly find the tools I learnt there to be extremely useful. The keywords are smart data management for big data, such as hadoop and hive for querying just the right set of data to work with. I am

Read more »

SBS documentary “The Age of Big Data”

November 8, 2014
By
SBS documentary “The Age of Big Data”

by Yanchang Zhao, RDataMining.com “Data is becoming a powerful and most valuable commodity in 21st century. It is leading to scientific insights and new ways of understanding human behaviour. Data can also make you rich. Very rich.” — SBS documentary … Continue reading →

Read more »

Learn about Revolution R Open in live webinar, November 12

November 7, 2014
By

On Wednesday next week, I'll be presenting a live webinar to introduce Revolution R Open and several other open source projects from Revolution Analytics. In the webinar I'll describe: The enhancements included in Revolution R Open The Reproducible R Toolkit and the checkpoint package How to call R from other applications with DeployR Open How to run R in...

Read more »

Introducing Revolution R Enterprise V 7.3

November 5, 2014
By

by Bill Jacobs Revolution R Enterprise is the industry's first R-based analytics platform that supports a variety of parallel, grid and clustered systems such as Hadoop, Teradata database and Platform LSF Linux grids. Last year, we enhanced Revolution R Enterprise (RRE) to support big data systems, with support for Hadoop. We continued expansion of RRE in 2014, adding support...

Read more »

A first look at Distributed R

October 23, 2014
By
A first look at Distributed R

by Joseph Rickert One of the most interesting R related presentations at last week’s Strata Hadoop World Conference in New York City was the session on Distributed R by Sunil Venkayala and Indrajit Roy, both of HP Labs. In short, Distributed R is an open source project with the end goal of running R code in parallel on data...

Read more »

Statistics doesn’t have to be so hard: simulate!

October 17, 2014
By

My second-favourite keynote from yesterday's Strata Hadoop World conference was this one, from Pinterest's John Rauser. To many people (especially in the Big Data world), Statistics is a series of complex equations, but a just a little intuition goes a long way to really understanding data. John illustrates this wonderfully using an example of data collected to determine whether...

Read more »

Introducing Revolution R Open and Revolution R Plus

October 15, 2014
By

For the past 7 years, Revolution Analytics has been the leading provider of R-based software and services to companies around the globe. Today, we're excited to announce a new, enhanced R distribution for everyone: Revolution R Open. Revolution R Open is a downstream distribution of R from the R Foundation for Statistical Computing. It's built on the R 3.1.1...

Read more »