306 search results for "hadoop"

Instructions for Installing & Using R on Amazon EC2

April 8, 2013
By

If you’re an R user, you’ve surely heard all the hype around ‘big data’ and how R is commonly used to analyze these volumes of data. One thing that’s often missing from the discussion is HOW to work around issues using big data and R, specifically how to deal with the fact that R stores Instructions for Installing...

Read more »

Lots of data != "Big Data"

March 28, 2013
By
Lots of data != "Big Data"

by Joseph Rickert When talking with data scientists and analysts — who are working with large scale data analytics platforms such as Hadoop — about the best way to do some sophisticated modeling task it is not uncommon for someone to say, "We have all of the data. Why not just use it all?" This sort of comment often...

Read more »

Build a search engine in 20 minutes or less

March 27, 2013
By
Build a search engine in 20 minutes or less

…or your money back. author = "Ben Ogorek"Twitter = "@baogorek"email = paste0(sub("@", "", Twitter), "@gmail.com") Setup Pretend this is Big Data: doc1 <- "Stray cats are running all over the place. I see 10 a day!"doc2 <- "Cats are killers. They...

Read more »

Massive online data stream mining with R

Massive online data stream mining with R

A few weeks ago, the stream package has been released on CRAN. It allows to do real time analytics on data streams. This can be very usefull if you are working with large datasets which are already hard to put in RAM completely, let alone to build some statistical model on it without getting into RAM problems. Most of...

Read more »

Revolution Newsletter: March 2013

March 25, 2013
By

The most recent edition of the Revolution Newsletter is out. The news section is below, and you can read the full March edition (with highlights from this blog and community events) online. You can subscribe to the Revolution Newsletter to get it monthly via email. Get Results Fast with our Quick Start Programs: Need help getting value from predictive...

Read more »

Data Science Education gets personal

March 14, 2013
By

by Joseph B. Rickert It is difficult to imagine that there is anyone on the planet with an internet connection and a desire to learn something new who has not at least looked into taking a massive open online course (MOOC). Last Fall, in an 11/4/12 article, the New York Time declared the Year of the MOOC and quoted...

Read more »

In case you missed it: February 2013 Roundup

March 13, 2013
By

In case you missed them, here are some articles from February of particular interest to R users. How to resample from a large data set with RHadoop, and a video introduction to the RHadoop packages. A 90-second video explains: What is Revolution R Enterprise? Jeffrey Stanton has published a free e-book "An Introduction to Data Science" using R. I...

Read more »

Revolution Analytics News Roundup

March 4, 2013
By

Between the Strata conference and various announcements, last week was certainly a busy one for the crew here at Revolution Analytics. So I thought I'd take the opportunity to catch you up on some of the recent media articles you might have missed: The Wall Street Journal interviewed our new VP of Services Neera Talbert on the trend towards...

Read more »

Summary of My First Trip to Strata #strataconf

February 28, 2013
By
Summary of My First Trip to Strata #strataconf

In this post I am goIing to summarize some of the things that I learned at Strata Santa Clara 2013. For now, I will only discuss the conference sessions as I have a much longer post about the tutorial sessions that I am still working on and will post at a later date. I will add to this post...

Read more »

Revolution Newsletter: February 2013

February 25, 2013
By

The most recent edition of the Revolution Newsletter is out. The news section is below, and you can read the full February edition (with highlights from this blog and community events) online. You can subscribe to the Revolution Newsletter to get it monthly via email. Case study: Real-Time Marketing Analytics. Online advertising technology company Exelate uses predictive models to...

Read more »