500 search results for "hadoop"

Documentation for Microsoft R Server now online

May 16, 2016
By
Documentation for Microsoft R Server now online

If you've been thinking about trying the big-data capabilities of Microsoft R Server but wanted to check out the documentation first, you're in luck: the complete Microsoft R Server documentation is now available on MSDN (and is accessible to anyone). There's lots to explore here, but a few highlights you might want to check out include: Getting Started with...

Read more »

R 3.3.0 is another motivation for Docker

May 12, 2016
By

Have you ever encountered R packages versioning issues when one application required different dependent packages versions than other? Have you ever got stuck with your project because of wrong pre-installed software versions on machine on which you should run your code? Or maybe you had heavy adventures with installing R software on a new machine because...

Read more »

Bike Rental Demand Estimation with Microsoft R Server

May 10, 2016
By
Bike Rental Demand Estimation with Microsoft R Server

by Katherine Zhao, Hong Lu, Zhongmou Li, Data Scientists at Microsoft Bicycle rental has become popular as a convenient and environmentally friendly transportation option. Accurate estimation of bike demand at different locations and different times would help bicycle-sharing systems better meet rental demand and allocate bikes to locations. In this blog post, we walk through how to use Microsoft...

Read more »

In case you missed it: April 2016 roundup

May 9, 2016
By

In case you missed them, here are some articles from April of particular interest to R users. Lukasz Piwek recreates classic graphs from Tufte's 'The Visual Display of Quantitative Information' in R. A preview of upcoming R conferences in Europe. Andrie de Vries updates the data on R package growth on CRAN, and finds a segmented regression model with...

Read more »

Exploring NYC Taxi Data with Microsoft R Server and HDInsight

April 19, 2016
By
Exploring NYC Taxi Data with Microsoft R Server and HDInsight

As I mentioned yesterday, Microsoft R Server now available for HDInsight, which means that you can now run R code (including the big-data algorithms of Microsoft R Server) on a managed, cloud-based Hadoop instance. Debraj GuhaThakurta, Senior Data Scientist, and Shauheen Zahirazami, Senior Machine Learning Engineer at Microsoft, demonstrate some of these capabilities in their analysis of 170M taxi...

Read more »

A scalable data science platform with Microsoft R Server and Spark

April 18, 2016
By

If you want to train a statistical model on very large amounts of data, you'll need three things: a storage platform capable of holding all of the training data, a computational platform capable of efficently performing the heavy-duty mathematical computations required, and a statistical computing language with algorithms that can take advantage of the storage and computation power. Microsoft...

Read more »

Answers to FAQ about SparkR for R users

April 5, 2016
By
Answers to FAQ about SparkR for R users

Many people keep asking me whether I have tried SparkR, is it worth using, is it sexy or WHAT is it at all. I felt that creating frequently asked questions (FAQ) in the field of WHAT is that Spark/SparkR? would help many R Scientists to understand this Big Data Buzz-tool. I have gathered information from the...

Read more »

AirbnB uses R to scale data science

April 5, 2016
By
AirbnB uses R to scale data science

Airbnb, the property-rental marketplace that helps you find a place to stay when you're travelling, uses R to scale data science. Airbnb is a famously data-driven company, and has recently gone through a period of rapid growth. To accommodate the influx of data scientists (80% of whom are proficient in R, and 64% use R as their primary data...

Read more »

Help improve treatment for brain injuries using machine learning and R

April 4, 2016
By
Help improve treatment for brain injuries using machine learning and R

The field of neuroscience -- the study of brains and the nervous system -- has taken some major leaps in recent years. Scientists can now gather real-time electrical activity from the brain during actions and thoughts, which is helping to pinpoint the exact location of brain lesions caused by strokes, and is leading to promising treatments for epilepsy and...

Read more »

A bit on the F1 score floor

April 2, 2016
By
A bit on the F1 score floor

At Strata+Hadoop World “R Day” Tutorial, Tuesday, March 29 2016, San Jose, California we spent some time on classifier measures derived from the so-called “confusion matrix.” We repeated our usual admonition to not use “accuracy” as a project goal (business people tend to ask for it as it is the word they are most familiar … Continue reading...

Read more »

Sponsors

Mango solutions



RStudio homepage



Zero Inflated Models and Generalized Linear Mixed Models with R

Quantide: statistical consulting and training



http://www.eoda.de







ODSC

ODSC

CRC R books series











Contact us if you wish to help support R-bloggers, and place your banner here.

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)