When do you need all the data for Big Analytics?

[This article was first published on Revolutions, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

In the 2012 edition of the SAP Sybase Capital Markets Guide, Revolution Analytics' Senior Advisor for Products and Strategy (and former CEO) Norman Nie writes about the “Five Benefits of Big Analytics”. (You can also read his article at Enterprise Innovation.) Norman makes the argument that while sampling and aggregation are often useful ways of handling very large data sets for statistical analysis, there are nonetheless several situations where using all of the data in the analysis is beneficial and/or important. They include situations where you need to:

  1. Make Predictions with Data Mining
  2. Deploy More Powerful Predictive Models
  3. Find and Understand Rare Events
  4. Extract and Analyze “Low-Incidence Populations”
  5. Move Beyond “Statistical Significance”

Not coincidentally, many of the big-data analysis features of Revolution R Enterprise have been designed to support these classes of data analysis. You can read the full article by dowloading the 2012 Sybase Capital Markets Guide from the link below (it's on pages 16-19). An expanded version of the article is also available as a white paper from Revolution Analytics.

SAP Sybase: Capital Markets Guide 2012

To leave a comment for the author, please follow the link and comment on their blog: Revolutions.

R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)