Articles by Guest Blogger

Employee Retention with R Based Data Science Accelerator

March 9, 2017 | Guest Blogger

by Le Zhang (Data Scientist, Microsoft) and Graham Williams (Director of Data Science, Microsoft) Employee retention has been and will continue to be one of the biggest challenges of a company. While classical tactics such as promotion, competitive perks, etc. are practiced as ways to retain employees, it is now ... [Read more...]

The Flexibility of Remote and Local R Workspaces

January 4, 2017 | Guest Blogger

by Sean Wells, Senior Software Engineer, Microsoft The mrsdeploy R package facilitates Remote Execution and Web Service interactions from your local R IDE command line against a remote Microsoft R Server instance. Both core features can be used independently of one another or combined to support different convenient workflows. These ...
[Read more...]

Parallelizing Data Analytics on Azure with the R Interface Tool

December 27, 2016 | Guest Blogger

by Le Zhang (Data Scientist, Microsoft) and Graham Williams (Director of Data Science, Microsoft) In data science, to develop a model with optimal performance, exploratory experiments on different sets of hyper-parameters are often performed. Preliminary analyses on smaller data can be performed on a single machine, while the experimental one ... [Read more...]

Calculating AUC: the area under a ROC Curve

November 22, 2016 | Guest Blogger

by Bob Horton, Microsoft Senior Data Scientist Receiver Operating Characteristic (ROC) curves are a popular way to visualize the tradeoffs between sensitivitiy and specificity in a binary classifier. In an earlier post, I described a simple “turtle’s eye view” of these plots: a classifier is used to sort cases ... [Read more...]

Data Manipulation with sparklyr on Azure HDInsight

November 8, 2016 | Guest Blogger

by Ali Zaidi, Data Scientist at Microsoft # Apache Spark and a Tale of APIs Spark is an exceptionally popular processing engine for distributed data. Dealing with data in distributed storage and programming with concurrent systems often requires learning complicated new paradigms and techniques. Statisticans and data scientists familiar wtih R ... [Read more...]

Sharing our R Programs — With Style

October 25, 2016 | Guest Blogger

by Graham Williams, Director of Data Science, Microsoft Programming is an art and a way we express ourselves. As we write our programs we should keep in mind that someone else is very likely to be reading it. We can facilitate the accessibility of our programs through a clear presentation ... [Read more...]

Estimating the value of a vehicle with R

October 18, 2016 | Guest Blogger

by Srini Kumar, Director of Data Science at Microsoft We tend to think of R and other such ML tools only in the context of the workplace, to do “weighty” things aimed at saving millions. A little judicious use of R may help us hugely in our personal lives too. ... [Read more...]

How to choose the right tool for your data science project

September 22, 2016 | Guest Blogger

by Brandon Rohrer, Principal Data Scientist, Microsoft R or Python? Torch or TensorFlow? (or MXNet or CNTK)? Spark or map-reduce? When we're getting started on a project, the mountain of tools to choose from can be overwhelming. Sometimes it makes me feel small and bewildered, like Alice in Wonderland. Luckily, ... [Read more...]

Introducing R for Big Data with PivotalR

June 4, 2014 | Guest Blogger

Wouldn't it be great if there was a way to harness the familiarity and usability of a tool like R, and at the same time take advantage of the performance and scalability benefits of in-database/in-Hadoop computation? We're happy to announce PivotalR, a package that translates R code into SQL ... [Read more...]
1 2

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)