When do you need all the data for Big Analytics?

April 18, 2012
By

(This article was first published on Revolutions, and kindly contributed to R-bloggers)

In the 2012 edition of the SAP Sybase Capital Markets Guide, Revolution Analytics' Senior Advisor for Products and Strategy (and former CEO) Norman Nie writes about the "Five Benefits of Big Analytics". (You can also read his article at Enterprise Innovation.) Norman makes the argument that while sampling and aggregation are often useful ways of handling very large data sets for statistical analysis, there are nonetheless several situations where using all of the data in the analysis is beneficial and/or important. They include situations where you need to:

  1. Make Predictions with Data Mining
  2. Deploy More Powerful Predictive Models
  3. Find and Understand Rare Events
  4. Extract and Analyze “Low-Incidence Populations”
  5. Move Beyond “Statistical Significance”

Not coincidentally, many of the big-data analysis features of Revolution R Enterprise have been designed to support these classes of data analysis. You can read the full article by dowloading the 2012 Sybase Capital Markets Guide from the link below (it's on pages 16-19). An expanded version of the article is also available as a white paper from Revolution Analytics.

SAP Sybase: Capital Markets Guide 2012

To leave a comment for the author, please follow the link and comment on his blog: Revolutions.

R-bloggers.com offers daily e-mail updates about R news and tutorials on topics such as: visualization (ggplot2, Boxplots, maps, animation), programming (RStudio, Sweave, LaTeX, SQL, Eclipse, git, hadoop, Web Scraping) statistics (regression, PCA, time series, trading) and more...



If you got this far, why not subscribe for updates from the site? Choose your flavor: e-mail, twitter, RSS, or facebook...

Tags: , , ,

Comments are closed.