Blog Archives

Become an effective data hacker with the R-Hadoop stack

September 24, 2014
By

In discussion with several data scientists, Will Stanton (a data scientist with Return Path) learned that a common concern is: what software should I be using? There are many options out there, but what is the best platform to be an effective "data hacker"? Will recommends using a technology stack with R and Hadoop, which allows data scientists "to...

Read more »

Around the world in 80k miles

September 22, 2014
By
Around the world in 80k miles

You're probably familiar with the classic Travelling Salesman problem: given (say) 20 cities, what is shortest route you can take that passes through all 20 cities and returns to the starting point? It's a difficult problem to solve, because you need to try all possible routes to find the minimum, and there are a LOT of possibilities. For a...

Read more »

Webinar September 25: Data Science with R

September 19, 2014
By

A quick heads up that if you'd like to get a great introduction to doing data science with the R language, Joe Rickert will be giving a free webinar next Thursday, September 25: Data Science with R. Regular readers of the blog will be familiar with Joe's posts on this topic. A few recent examples include posts on comparing...

Read more »

Applications of R presentations at Dataweek

September 17, 2014
By

I'm speaking at the DataWeek conference in San Francisco today. My talk follows Skylar Lyon from Accenture — I'm really looking forward to hearing how he uses Revolution R Enterprise with Teradata Database to run R in-database with 400 million rows of data. Update: Here are Skylar's slides. The slides for my talk on other companies' applications of R...

Read more »

New members for R-core and R Foundation

September 16, 2014
By

The R Foundation for Statistical Computing, the Vienna-based non-profit organization that oversees the R Project, has just added several new "ordinary members". (Ordinary members participate in R Foundation meetings and provide guidance to the project.) The new members are: Dirk Eddelbuettel, Torsten Hothorn, Marc Schwartz, Hadley Wickham, and Achim Zeileis, Martin Morgan and Michael Lawrence. The R Core group,...

Read more »

Using Reddit’s JSON API to analyze post popularity

September 15, 2014
By
Using Reddit’s JSON API to analyze post popularity

Graduate student Clay McLeod decided to find out what makes a post on the social-sharing site Reddit popular. These are the questions he seeks to answer: What’s in a post? Reddit pulls in around 115 million unique visitors each month, amassing a staggering 5 billion page views per month. For a long time, I’ve wondered what factors draw people...

Read more »

Google uses R to calculate ROI on advertising campaigns

September 12, 2014
By
Google uses R to calculate ROI on advertising campaigns

Google has just released a new package for R: CausalImpact. Amongst many other things, this package allows Google to resolve the classical conundrum: how can we asses the impact of an intervention (for example, the effect of an advertising campaign on website clicks) when we can't know what would have happened if we hadn't run the campaign? For a...

Read more »

More presentations from useR! 2014

September 10, 2014
By

DataScience.LA has posted a great recap of the latest LA R meetup, which in turn was a recap of presentations from the useR! 2014 conference. Follow that link to review slides from the event, whith summaries of useR! 2014 related to R and Python; Finance; dplyr; R books; SalesForce and R AnalyticFlow. DataScience.LA has also posted more videos from...

Read more »

In case you missed it: August 2014 Roundup

September 5, 2014
By

In case you missed them, here are some articles from August of particular interest to R users: R is the most popular software in the KDNuggets poll for the 4th year running. The frequency of R user group meetings continues to rise, and there are now 147 R user groups worldwide. A video interview with David Smith, Chief Community...

Read more »

Hortonworks Seminar Series: The Modern Data Architecture

September 3, 2014
By

As more companies explore the benefits that Hadoop may provide, the opportunities to better understand the technology are myriad and unequal. As a provider of in-Hadoop analytics, Revolution Analytics is participating in the coming Hortonworks seminar series. We will be on site to discuss how to deploy R-based analytics within Hadoop clusters using Revolution R Enterprise. The seminar series...

Read more »