Blog Archives

Using R to Gain Insights into the Emotional Journeys in War and Peace

December 1, 2016
By
Using R to Gain Insights into the Emotional Journeys in War and Peace

by Wee Hyong Tok, Senior Data Scientist Manager at Microsoft How do you read a novel in record time, and gain insights into the emotional journey of main characters, as they go through various trials and tribulations, as an exciting story unfolds from chapter to chapter? I remembered my experiences when I start reading a novel, and I get...

Read more »

Calculating AUC: the area under a ROC Curve

November 22, 2016
By
Calculating AUC: the area under a ROC Curve

by Bob Horton, Microsoft Senior Data Scientist Receiver Operating Characteristic (ROC) curves are a popular way to visualize the tradeoffs between sensitivitiy and specificity in a binary classifier. In an earlier post, I described a simple “turtle’s eye view” of these plots: a classifier is used to sort cases in order from most to least likely to be positive,...

Read more »

SAS to R Migration for Financial Data: Lessons and Examples

November 14, 2016
By
SAS to R Migration for Financial Data: Lessons and Examples

by Lixun Zhang (Data Scientist), Ye Xing (Senior Data Scientist) and Tao Wu (Principal Data Scientist Manager), all at Microsoft Editor's Note: To learn more about migrating from SAS to R, there will be a live webinar presented by Lixun and Ye tomorrow (Tuesday, November 15). Register to attend the webinar here. R has been gaining in popularity among...

Read more »

Data Manipulation with sparklyr on Azure HDInsight

November 8, 2016
By
Data Manipulation with sparklyr on Azure HDInsight

by Ali Zaidi, Data Scientist at Microsoft # Apache Spark and a Tale of APIs Spark is an exceptionally popular processing engine for distributed data. Dealing with data in distributed storage and programming with concurrent systems often requires learning complicated new paradigms and techniques. Statisticans and data scientists familiar wtih R are unlikely to have much experience with such...

Read more »

Sharing our R Programs — With Style

October 25, 2016
By

by Graham Williams, Director of Data Science, Microsoft Programming is an art and a way we express ourselves. As we write our programs we should keep in mind that someone else is very likely to be reading it. We can facilitate the accessibility of our programs through a clear presentation of the messages we are sharing. As data scientists...

Read more »

Estimating the value of a vehicle with R

October 18, 2016
By
Estimating the value of a vehicle with R

by Srini Kumar, Director of Data Science at Microsoft We tend to think of R and other such ML tools only in the context of the workplace, to do “weighty” things aimed at saving millions. A little judicious use of R may help us hugely in our personal lives too. The ideas of regression, classification trees etc. can be...

Read more »

Building Scalable Data Pipelines with Microsoft R Server and Azure Data Factory

October 4, 2016
By
Building Scalable Data Pipelines with Microsoft R Server and Azure Data Factory

by Udayan Kumar, Data Scientist at Microsoft Beginning in 2016, Microsoft rolled out a preview of Microsoft R Server (MRS) for Azure HDInsight clusters. This service provides a preconfigured instance of R server with Spark/Hadoop that can be provisioned within minutes. Recent blog posts (by Max Kaznady and David Smith) have highlighted how to use and tune this service...

Read more »

How to choose the right tool for your data science project

September 22, 2016
By
How to choose the right tool for your data science project

by Brandon Rohrer, Principal Data Scientist, Microsoft R or Python? Torch or TensorFlow? (or MXNet or CNTK)? Spark or map-reduce? When we're getting started on a project, the mountain of tools to choose from can be overwhelming. Sometimes it makes me feel small and bewildered, like Alice in Wonderland. Luckily, the Cheshire Cat cut to the heart of the...

Read more »

A predictive maintenance solution template with SQL Server R Services

September 13, 2016
By
A predictive maintenance solution template with SQL Server R Services

by Jaya Mathew, Data Scientist at Microsoft By using R Services within SQL Server 2016, users can leverage the power of R at scale without having to move their data around. Such a solution is beneficial for organizations with very sensitive, big data which cannot be hosted on any public cloud but does most of their coding in R....

Read more »

Introducing R for Big Data with PivotalR

June 4, 2014
By
Introducing R for Big Data with PivotalR

Wouldn't it be great if there was a way to harness the familiarity and usability of a tool like R, and at the same time take advantage of the performance and scalability benefits of in-database/in-Hadoop computation? We're happy to announce PivotalR, a package that translates R code into SQL for processing, is available to download from GitHub today.

Read more »

Sponsors

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)