356 search results for "hadoop"

A first look at Distributed R

October 23, 2014
By
A first look at Distributed R

by Joseph Rickert One of the most interesting R related presentations at last week’s Strata Hadoop World Conference in New York City was the session on Distributed R by Sunil Venkayala and Indrajit Roy, both of HP Labs. In short, Distributed R is an open source project with the end goal of running R code in parallel on data...

Read more »

Statistics doesn’t have to be so hard: simulate!

October 17, 2014
By

My second-favourite keynote from yesterday's Strata Hadoop World conference was this one, from Pinterest's John Rauser. To many people (especially in the Big Data world), Statistics is a series of complex equations, but a just a little intuition goes a long way to really understanding data. John illustrates this wonderfully using an example of data collected to determine whether...

Read more »

Introducing Revolution R Open and Revolution R Plus

October 15, 2014
By

For the past 7 years, Revolution Analytics has been the leading provider of R-based software and services to companies around the globe. Today, we're excited to announce a new, enhanced R distribution for everyone: Revolution R Open. Revolution R Open is a downstream distribution of R from the R Foundation for Statistical Computing. It's built on the R 3.1.1...

Read more »

In case you missed it: September 2014 Roundup

October 8, 2014
By

In case you missed them, here are some articles from September of particular interest to R users. Norm Matloff argues that T-tests shouldn't be part of the Statistics curriculum and questions the "star system" for p-values in R. A nice video introduction to the dplyr package and the %>% operator, presented by Kevin Markham. An animation of police militarization...

Read more »

R and Data Science Webinar

October 2, 2014
By

by Joseph Rickert Recently, I had the opportunity to present a webinar on R and Data Science. The challenge with attempting this sort of thing is to say something interesting that does justice to the subject while being suitable for an audience that may include both experienced R users and curious beginners. The approach I settled on had three...

Read more »

Data Science Toolbox Survey Results… Surprise! R and Python win

September 24, 2014
By
Data Science Toolbox Survey Results… Surprise! R and Python win

This is a re-publication of a blog post from a blog I created not long before...

Read more »

Build Predictive Model on Big data: Using R and MySQL Part-1

September 21, 2014
By
Build Predictive Model on Big data: Using R and MySQL Part-1

Wellcome to the series blog posts. Since long time, I am writing post on Machine learning with R. Today I am gonna discuss on big data problem while fitting machine learning on it and its solution using MySQL and R. Before we jump directly to solution, let us discuss about big data little bit. (You The post Build...

Read more »

R at Conferences this Fall

September 11, 2014
By

by Joseph Rickert The days are getting shorter here in California and the summer R conferences UseR!2014 and JSM are behind us, but there are still some very fine conferences for R users to look forward to before the year ends. DataWeek starts in San Francisco on September 15th. I will be conducting a bootcamp for new R users,...

Read more »

Visualizing Website Pathing With Sankey Charts

September 10, 2014
By
Visualizing Website Pathing With Sankey Charts

In my prior post on visualizing website structure using network graphs, I referenced that network graphs showed the pairwise relationships between two pages (in a bi-directional manner). However, if you want to analyze how your visitors are pathing through your site, you can visualize your data using a Sankey chart. Visualizing Single Page-to-Next Page Pathing Related posts:

Read more »

Hortonworks Seminar Series: The Modern Data Architecture

September 3, 2014
By

As more companies explore the benefits that Hadoop may provide, the opportunities to better understand the technology are myriad and unequal. As a provider of in-Hadoop analytics, Revolution Analytics is participating in the coming Hortonworks seminar series. We will be on site to discuss how to deploy R-based analytics within Hadoop clusters using Revolution R Enterprise. The seminar series...

Read more »