NYT on Big Data and R

October 24, 2011

(This article was first published on Revolutions, and kindly contributed to R-bloggers)

In the New York Times' "Bits" blog today, Quentin Hardy offers recollections on Big Data talks at the recent Web 2.0 Summit. He begins with a definition of Big Data:

Big Data is really about … the benefits we will gain by cleverly sifting through it to find and exploit new patterns and relationships. You see it now in things like Facebook ads, which are put in front of you because the posts you have read and contributed to (which Facebook’s algorithms get to examine as the price of this “free” service) indicate you might be ready to buy the advertised good.

The article includes applications of big data analytics at various companies: ad placement at Google; credit card transaction analysis (according the CEO of TrialPay, the value of the transaction data exceeds the transaction fee credit companies like Visa charge merchants); and inferring information from the semantic web at search start-up Domo. (By the way, here's a great presentation on using R to mine the semantic web, from Chris Davis and Alfredas Chmieliauskas of the Amsterdam R Users Group.)

Speaking of R, the NYT article ended with a mention of new tools for Big Data analyyics: MapReduce, NoSQL and R:

There are an uncountable number of data-mining start-ups in the field: MapReduce and NoSQL for managing the stuff; and the open-source R statistical programming language, for making predictions about what is likely to happen next, based on what has happened before. 

For example, R is used at social networking sites like Foursquare and OMGPOP to make predictions based on user transaction data.

Bits Blog: The Big Business of ‘Big Data’

To leave a comment for the author, please follow the link and comment on their blog: Revolutions.

R-bloggers.com offers daily e-mail updates about R news and tutorials on topics such as: Data science, Big Data, R jobs, visualization (ggplot2, Boxplots, maps, animation), programming (RStudio, Sweave, LaTeX, SQL, Eclipse, git, hadoop, Web Scraping) statistics (regression, PCA, time series, trading) and more...

If you got this far, why not subscribe for updates from the site? Choose your flavor: e-mail, twitter, RSS, or facebook...

Tags: , , ,

Comments are closed.

Search R-bloggers


Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)