345 search results for "hadoop"

jpmml and R (Free Webinar)

July 28, 2014
By
jpmml and R (Free Webinar)

This free, global webinar will provide an introduction to jpmml, the world’s leading open-source PMML scoring engine currently being utilized by companies such as Airbnb to rapidly deploy predictive models into production. Webinar Format: – What is PMML? – Building … Continue reading →

Read more »

Preparing Big Data for Analysis in R

July 15, 2014
By
Preparing Big Data for Analysis in R

by Yaniv Mor, Co-founder & CEO of Xplenty How do you get Big Data ready for R? Gigabytes or terabytes of raw data may need to be combined, cleaned, and aggregated before they can be analyzed. Processing such large amounts of data used to require installing Hadoop on a cluster of servers, not to mention coding MapReduce jobs in...

Read more »

Reflections on John Chambers’ UserR! 2014 Keynote Address

July 10, 2014
By
Reflections on John Chambers’ UserR! 2014 Keynote Address

by Joseph Rickert John Chambers opened UseR! 2014 by describing how the R language grew out of early efforts to give statisticians easier access to high quality statistical software. In 1976 computational statistics was a very active field, but most algorithms were compiled as Fortran subroutines. Building models with this software was not a trivial process. First you had...

Read more »

Revolution Analytics: the R company since 2007

July 2, 2014
By

Revolution Analytics, founded in 2007, was the first company devoted to the R project. Since then, we've been behind several R initiatives, including the RHadoop project and the network of R user groups around the world. I gave this short presentation today at the useR! 2014 conference in Los Angeles with some of the highlights from Revolution Analytics from...

Read more »

R/Finance 2014 Review

June 30, 2014
By

It's been more than a month since R/Finance 2014, and my job has finally slowed down enough to allow me to write down my thoughts (though I'm writing this over two days during my train to and from Chicago).The comments below are based on my personal ex...

Read more »

Maybe I Don’t Really Know R After All

June 26, 2014
By
Maybe I Don’t Really Know R After All

Lately, I’ve been feeling that I’m spreading myself too thin in terms of programming languages. At work, I spend most of my time in Hive/SQL, with the occasional Python for my smaller data. I really prefer Julia, but I’m alone at work on that one. And since I maintain a package on CRAN (RSiteCatalyst), I frequently spend Related posts:

Read more »

Jun 26-27, 2014 – Introduction to Data Science with R in NYC

June 26, 2014
By
Jun 26-27, 2014 – Introduction to Data Science with R in NYC

You can either register from eventbrite or our school site NYC Data Science Academy. Date: Thursday/Friday , June 26th and 27th, 2014 Time:  9:00am to 5:00pm Location: 500 7th Ave, 17th Floor, glass door classroom, New York, NY 10018 NYC Data Science Academy, training subbrand of SupStat (Official Training partner with RStudio Inc) is hosting our... Read more »

Using Julia As A ‘Glue’ Language

June 24, 2014
By

While much of the focus in the Julia community has been on the performance aspects of Julia relative to other scientific computing languages, Julia is also perfectly suited to ‘glue’ together multiple data sources/languages. In this blog post, I will cover how to create an interactive plot using Gadfly.jl, by first preparing the data using Related posts:

Read more »

Five Hard-Won Lessons Using Hive

June 12, 2014
By

I’ve been spending a ton of time lately on the data engineering side of ‘data science’, so I’ve been writing a lot of Hive queries. Hive is a great tool for querying large amounts of data, without having to know very much about the underpinnings of Hadoop. Unfortunately, there are a lot of things about Five Hard-Won...

Read more »

The 7th China R Conference in Beijing

June 4, 2014
By
The 7th China R Conference in Beijing

The 7th China R Conference in Beijing was held on May 24th ~May 25th in Renmin University of China. SupStat is really happy and honored to sponsor and attend this meeting. This is the largest ever R conference in China with 1814 registrations online and even 50 more requests of attendance with the help of special... Read more »