304 search results for "hadoop"

R and (Software) Relatives

February 18, 2014
By
R and (Software) Relatives

Post also available with code executed inline at rpubs.com.O'Reilly recently published the results of a survey from attendees of the Strata Conference related to tool usage and salary.  The entire survey is available for download.  In the survey results, R was heralded as second only to SQL as a tool used by conference attendees.  An chart from the...

Read more »

There is no Such Thing as Biomedical "Big Data"

February 11, 2014
By
There is no Such Thing as Biomedical "Big Data"

At the moment, the world is obsessed with “Big Data” yet it sometimes seems that people who use this phrase don’t have a good grasp of its meaning.  Like most good buzz-words, “Big Data” sparks the idea of something grand and complicated, while sounding ordinary enough that listeners feel like they have an intuitive understanding of the concept.  However...

Read more »

Revolution Analytics announces $999 site licenses for universities and public service organizations

February 4, 2014
By

by Joseph Rickert Revolution Analytics is announcing three new programs today that we hope will be modest but positive contributions to data science education and public service analytics. The first new program, the Academic Institution Program (AIP) enables colleges, universities and other educational institutions to obtain a site license for Revolution Analytics' commercial distribution of the R Language, Revolution...

Read more »

Book review: "Doing Data Science" by Rachel Schutt and Cathy O’Neil

January 23, 2014
By

by Joseph Rickert Every once in a while a single book comes to crystallize a new discipline. If books still have this power in the era of electronic media, "Doing Data Science, Straight Talk from the Frontline" by Rachel Schutt and Cathy O’Neil: O'Reilly, 2013 might just be the book that defines data science. "Doing Data Science", which is...

Read more »

AMPLab Announces Developer Preview of SparkR

January 20, 2014
By

The team at AMPLab has announced a developer preview of SparkR, an R package enabling R users to run jobs on an Apache Spark cluster. Spark is an open source project that supports distributed in-memory computing for advanced analytics, such as fast queries, machine learning, streaming analytics and graph engines. Spark works with every data format supported in Hadoop,...

Read more »

R Users, Meet Joyent Manta; Manta Users, Meet R

January 17, 2014
By

When I joined Joyent last year, I jumped on the opportunity to make R work with Joyent Manta. If you are new to Joyent, we are the High-Performance Cloud Infrastructure Company. If you are new to data analytics, the R language is a rich open-source env...

Read more »

Guest Blogger Recap

January 13, 2014
By

We had a marvellous series of guest posts here on the blog over the past few weeks. I'd like to give a special thanks to all of our guest bloggers for contributing, with special thanks to Joe Rickert for stepping in as our acting editor for the past 3 weeks. If you were celebrating or vacationing over the holidays,...

Read more »

Why R is Better Than Excel for Fantasy Football (and most other) Data Analysis

January 13, 2014
By

Many articles have been written on why R is better than Excel for data analysis.  In this post, I will summarize the reasons why R is advantageous in most data The post Why R is Better Than Excel for Fantasy Football (and most other) Data Analysis appeared first on Fantasy Football Analytics.

Read more »

Hungarian RUG: lighting talks

January 13, 2014
By
Hungarian RUG: lighting talks

The only Hungarian R User Group so far, the Budapest Users of R Network (BURN), with already more than 120 members, is hosting several lighting talks on the 15th of January (Wednesday): Bence Arató (BI Consulting): Who uses R?László Bódó: C++ calls in RGergely Daróczi (rapporter.net): A Pandoc writer in RImre Kocsis (BME MIT): RHadoop. MapReduce in RLászló Köleséri (Planiméter):...

Read more »

Forecasting By Combining Expert Opinion

January 3, 2014
By
Forecasting By Combining Expert Opinion

by Michael Helbraun Michael is member of Revolution Analytics Sales Support team. In the following post, he shows how to synthesize a probability distribution from the opinion of multiple experts: an excellent way to construct a Bayesian prior. There are lots of different ways to forecast. Depending on whether there’s historical data, trend, or seasonality you might choose to...

Read more »