339 search results for "hadoop"

Revolution Newsletter: June 2013

June 27, 2013
By

The most recent edition of the Revolution Newsletter came out a couple of weeks ago. In case you missed it, the news section is below, and you can read the full June edition (with highlights from this blog and community events) online. You can subscribe to the Revolution Newsletter to get it monthly via email. R is for Analytics:...

Read more »

Time Is on My Side – A Small Example for Text Analytics on a Stream

June 23, 2013
By
Time Is on My Side – A Small Example for Text Analytics on a Stream

Introduction and Background While my last posting was about recommendation in the context of Location Based Social Networks there are also other interesting topics regarding the analysis of unstructured data. The most established one is probably Text Analytics/Mining focusing on all sorts of text data.For me, coming from spatial analysis, these topic is relatively new but I couldn’t help noticing...

Read more »

PivotalR Improves the Scalability and Performance of In-Database Analytics

June 18, 2013
By
PivotalR Improves the Scalability and Performance of In-Database Analytics

One of the greatest challenges while working with big datasets concerns the need to move information out of storage for analysis. To this end, the recent announcement of PivotalR 0.1 extends Pivotal HD's capabilities, allowing users of the statistical programming language R to perform in-database analytics without leaving the command line.

Read more »

Bringing R to the Enterprise – new white paper available

June 10, 2013
By

Check out this new white paper entitled "Bringing R to the Enterprise -  A Familiar R Environment with Enterprise-Caliber Performance, Scalability, and Security." In this white paper, we begin with "Beyond the Laptop" exploring the ability to run R code in the database, working with CRAN packages at the database server, operationalizing R analytics, and...

Read more »

Mahout for R Users

June 9, 2013
By
Mahout for R Users

I have a few posts coming up on Apache Mahout so I thought it might be useful to share some notes. I came at it as primarily an R coder with some very rusty Java and C++ somewhere in the back of my head so that will be my point of reference. I’ve also included … Continue reading...

Read more »

A Big Data introduction

June 5, 2013
By

Since R uses the computer RAM, it may handle only rather small sets of data. Nevertheless, there are some packages that allow to treat larger volumes and the best solution is to connect R with a Big Data environment. This … Continue reading →

Read more »

Ryan Sheftel: "R on the Trading Desk"

May 30, 2013
By

by Joseph Rickert In a post last week, I offered some first impressions about R/Finance 2013. Apparently, I was way off in estimating that 30% of the attendees were academics. The R/Finance organizers were quick to point out that percentage of academics attending the conference has been a constant 10% over the years; and this year was no different....

Read more »

Stepping up to Big Data with R and Python: A Mind Map of All the Packages You Will Ever Need

May 29, 2013
By
Stepping up to Big Data with R and Python: A Mind Map of All the Packages You Will Ever Need

On May 8, we kicked off the transformation of R Users DC to Statistical Programming DC   (SPDC) with a meetup at iStrategyLabs in Dupont Circle. The meetup, titled “Stepping up to big data with R and Python,” was an … Continue reading →The post Stepping up to Big Data with R and Python: A Mind Map...

Read more »

Sentiment analysis finds trouble in the Enron emails

May 24, 2013
By
Sentiment analysis finds trouble in the Enron emails

The Enron email dataset, collected during the FERC investigation of the Enron financial scandal, represents the largest publicly available set of emails. This makes theman ideal testbed for sentiment analysis algorithms. Ikanow's Andrew Strite used the open-source Infinit.e framework and a Hadoop cluster to generate sentiment scores for all of the Enron emails, and then used R to manipulate...

Read more »

Big Data Analytics in R – the tORCH has been lit!

May 22, 2013
By

Normal 0 false false false EN-US X-NONE X-NONE ...

Read more »