331 search results for "hadoop"

Orbitz: R has become the data-mining tool of choice

May 17, 2012
By

Sameer Chopra, vice president of Advanced Analytics at Orbitz Worldwide, wrote recently in Analytics magazine about the changing landscape of processes, software and systems for statistical modelers. In a section on "Big Data and Open Source Analytics", Chopra lays out the reasons why the R language "has become the data-mining tool of choice for machine learners": R has very...

Read more »

Revolution Newsletter: May 2012

May 16, 2012
By

The most recent edition of the Revolution Newsletter is out. The news section is below, and you can read the full May edition (with highlights from this blog and community events) online. You can subscribe to the Revolution Newsletter to get it monthly via email. New R Training Courses Announced. Three new R courses from leading R experts are...

Read more »

In case you missed it: April 2012 Roundup

May 10, 2012
By

In case you missed them, here are some articles from April of particular interest to R users. Information Age published a feature article on R, describing how new graduates are driving adoption of R in industry. Bob Muenchen has updated his list of R package equivalents to SAS and SPSS procedures. A history of Data Science, including Bill Cleveland's...

Read more »

Heartbeat of a Cycling City: Bixi data at Hack/Reduce

May 8, 2012
By
Heartbeat of a Cycling City: Bixi data at Hack/Reduce

The recent Hack/Reduce hackathon in Montreal was a tonne of fun. Our team tackled a data set of consisting of Bixi (Montreal’s bicycle share system) station states at one minute temporal resolution. We used Hadoop and mapreduce to pull out some features of user behaviours. One of the things we extracted was the flux at

Read more »

Online resources for handling big data and parallel computing in R

May 6, 2012
By
Online resources for handling big data and parallel computing in R

by Yanchang Zhao, RDataMining.com Compared with many other programming languages, such as C/C++ and Java, R is less efficient and consumes much more memory. Fortunately, there are some packages that enables parallel computing in R and also packages for processing … Continue reading →

Read more »

Yes, you need more than just R for Big Data Analytics

May 2, 2012
By

Douglas Merrill, former CIO/VP of Engineering at Google, writes in Forbes about using the R language for data analysis: Most folks with math-oriented graduate degrees will have written something in R, a non-commercial option for your big data analysis. So, great graduates from great graduate schools know great tools. His post is titled 'R Is Not Enough For "Big...

Read more »

Information Age: graduates driving industry adoption of R

April 30, 2012
By

Information Age recently published a feature article devoted to the R language, "Putting the R in analytics". Says author Pete Swabey: Already popular in universities, there are signs that R is finding increasing adoption in the enterprise. This promises to lower the barriers of entry for advanced analytics, and may accelerate the mathemitisation of business management. The article includes...

Read more »

Revolution Newsletter: April 2012

April 20, 2012
By

The most recent edition of the Revolution Newsletter is out. The news section is below, and you can read the full April edition (with highlights from this blog and community events) online. You can subscribe to the Revolution Newsletter to get it monthly via email. Spring Webinar Series. Our Spring Webinar Series features presentations from Revolution Analytics staff and...

Read more »

Get your large SQL data in ff swiftly

Get your large SQL data in ff swiftly

The ff package is great when you are working with large data in R. Data in corporate environments are usually not that large that a Hadoop system is needed to handle it but the data are mostly large enough to make R choke on it's RAM.  T...

Read more »

Revolution Analytics Spring Webinar Series

April 17, 2012
By

The webinar team at Revolution Analytics has put together a great program over the next couple of months. With a mix of guest speakers and Revolution Analytics staff, this series will cover topics as diverse as Big Data with R and Hadoop, integrating R with MS Office, spatial statistics with R, data mining with R, retail marketing analytics, and...

Read more »