Big Data

Webinar: Leveraging R in Hadoop Environments

September 6, 2011 | David Smith

On Wednesday September 21, Revolution Analytics' CTO David Champagne will give a live webinar introducing three new open-source packages for R and Hadoop, which make it possible to work with Hadoop data in R, and bring in-database R analytics to Hadoop. Here are the details: Date: Wednesday, September 21st Time: 10:00AM ... [Read more...]

Big Analytics: Closing the "clue gap" with Big Data

August 31, 2011 | David Smith

There's been an growing discussion over the past couple of years on the topic of Big Data: how to deal with the situation when you have more data than can be conveniently managed and analyzed by traditional software tools. But Big Data has little intrinsic value in its own right: ... [Read more...]

GigaOm article on R, Big Data and Data Science

July 18, 2011 | David Smith

I'm really pleased that an article I wrote, "5 real-world uses of big data", has been published in the widely-read technology blog GigaOm. In the article, I review five examples of using data science techniques and R to make sense of some large real-world data sets: Drew Conway's analysis of the ... [Read more...]

Big-Data PCA: 50 years of stock data

June 17, 2011 | Sherry Lamonica

In this post, Revolution engineer Sherry LaMonica shows us how to use the RevoScaleR big-data package in Revolution R Enterprise to do principal components analysis on 50 years of stock market data -- ed. Principal components analysis, or PCA, seeks to find a set of orthogonal axes such that the first ... [Read more...]

The Big Analytics Revolution starts with R

June 15, 2011 | David Smith

Thanks to everyone who attended our webinar The 'Big Analytics' Revolution Starts with R yesterday. If you missed the live session, you can download the presentation slides (PDF) and the 30-minute replay video (WMV) from the Revolution Analytics website. The presentation focuses on the isse of Big Data, and how ... [Read more...]

K-Means Clustering on Big Data

June 7, 2011 | Joseph Rickert

In this post Joseph Rickert demonstrates how to build a classification model on a large data set with the RevoScaleR package. A script file for use with Revolution R Enterprise to recreate the analysis below is at the end of the post, and can also be downloaded here -- ed. ... [Read more...]

The Netflix Prize, Big Data, SVD and R

May 31, 2011 | David Smith

One of the key data analysis tools that the BellKor team used to win the Netflix Prize was the Singular Value Decomposition (SVD) algorithm. As a file on disk, the Neflix Prize data (a matrix of about 480,000 members' ratings for about 18,000 movies) was about 65Gb in size -- too large ... [Read more...]

My Experience at Hadoop Summit 2010 #hadoopsummit

June 30, 2010 | Ryan

This week I had the opportunity the trek up north to Silicon Valley to attend Yahoo’s Hadoop Summit 2010. I love Silicon Valley. The few times I’ve been there the weather was perfect (often warmer than LA), little to no traffic, no road rage and people overall seem friendly ...
[Read more...]

Lessons Learned from EC2

March 24, 2010 | Ryan

A week or so ago I had my first experience using someone else’s cluster on Amazon EC2. EC2 is the Amazon Elastic Compute Cloud. Users set up a virtual computing platform that runs on Amazon’s servers “in the cloud.” Amazon EC2 is not just another cluster. EC2 allows ...
[Read more...]

Review of R in NYT and GDAT

January 8, 2009 | Neil Gunther

GDAT instructor, Jim Holtman, pointed me at this review of R in yesterday's New York Times. It definitely puts SAS on the defensive.Update: Another piece in the tech section of NYT.If you want to know how to apply R to performance data, sign up for th...
[Read more...]
1 3 4 5

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)