205 search results for "hadoop"

The next generation of parallel R

November 2, 2011
By
The next generation of parallel R

In view of open-source parallel computing with R this week presents a big step to the future. R 2.14.0 was released at October 31th, 2011. Now, R base ships with a parallel computing package called “parallel”.  library(parallel) It combines advantages of the packages multicore and snow and it contains support for multiple RNG streams. The

Read more »

Last chance to enter the $20,000 "Applications of R" contest

October 31, 2011
By

Don't forget that midnight tonight (Pacific Daylight Time) is the deadline to submit your draft entry for the Applications of R in Business Contest, with $20,000 in prizes up for grabs from Revolution Analytics. You'll still be able to edit your entry during the public review period (until November 30), but no new entries will be accepted for the...

Read more »

"Anyone planning to work with Big Data ought to learn Hadoop and R"

October 25, 2011
By

Dan Woods at Forbes interviewed LinkedIn's Daniel Tunkelang about the rise of data science and on building data science teams. When asked how students today should prepare themselves to be data scientists, Tunkelang gives some good advice: When we built the data science team at LinkedIn a few years ago, we looked for raw talent, assuming that smart people...

Read more »

One week left to enter the $20,000 "Applications of R" contest

October 24, 2011
By
One week left to enter the $20,000 "Applications of R" contest

The deadline to enter the "Applications of R in Business" contest is just a week away. To qualify for $20,000 in prizes from Revolution Analytics, your entry must be submitted to inside-r.org by midnight PST on October 31. Note that this doesn't have to be your final submission: as long as you've entered a draft version, you can still...

Read more »

ACM Data Mining Camp 2011: Report

October 18, 2011
By

(By Joseph Rickert.) In San Jose topics like big data, map reduce, predictive models, mobile analytics and crowdsourcing draw a crowd even on a Saturday. So it turned out that the ACM data Mining Camp and "un-conference" was a very "happening" way to spend a Saturday. Over 500 people attended the event at the Ebay "Town Hall" on North...

Read more »

Revolution Newsletter: October 2011

October 17, 2011
By

The most recent edition of the Revolution Newsletter is out. The news section is below, and you can read the full October edition (with highlights from this blog and community events) online. You can subscribe to the Revolution Newsletter to get it monthly via email. Applications of R Contest: Deadline October 31. Revolution Analytics is offering $20,000 in prizes...

Read more »

Implementing K-means clustering for Hadoop in R and Java

October 14, 2011
By
Implementing K-means clustering for Hadoop in R and Java

At the Bay Area R User Group meeting this week, Antonio Piccolboni gave an overview of the design goals and implementation of the RHadoop Project packages that connect Hadoop and R: rhdfs, rhbase and rmr: (The image above was captured from Antionio's slides.) The most revealing part of the talk for me was the comparison of implementing the K-means...

Read more »

Tomorrow: ACM Data Mining Camp at eBay

October 14, 2011
By

If you're in the Bay Area, tomorrow would be a great day to head down to San José for the ACM Data Mining Camp. Hundreds of data scientists, data hackers and data miners will be there for a fun "unconference", with talks and practical sessions organized on the spot according to demand. Revolution Analytics is proud to be a...

Read more »

In case you missed it: September Roundup

October 7, 2011
By

In case you missed them, here are some articles from September of particular interest to R users. The deadline to enter the "R Applications" contest with $20,000 in prizes is October 31. The RHadoop Project, a new collection of open-source R packages from Revolution Analytics, makes it possible to write map-reduce jobs in R to analyze huge data sets...

Read more »

Oracle’s Big Data Appliance to include R

October 3, 2011
By

At the Oracle OpenWorld conference in San Francisco today, Oracle announced the new Oracle Big Data Appliance, "a new engineered system that includes an open source distribution of Apache™ Hadoop™, Oracle NoSQL Database, Oracle Data Integrator Application Adapter for Hadoop, Oracle Loader for Hadoop, and an open source distribution of R." Oracle's foray into the Hadoop and NoSQL spaces...

Read more »