Blog Archives

Using Microsoft R Server and dplyrxdf to Predict Flight Arrival Delays

July 5, 2016
By
Using Microsoft R Server and dplyrxdf  to Predict Flight Arrival Delays

by Konstantin Golyaev, Data Scientist at Microsoft I recently participated in an internal one-day Microsoft R Server (MRS) hackathon. For an experienced base R user but a complete MRS novice, this turned out to be an interesting challenge. R has fantastic and unparalleled set of tools for exploratory data analysis, as long as your data set is small enough...

Read more »

The useR! 2016 Tutorials

June 30, 2016
By

by Joseph Rickert Over the years I have seen several excellent tutorials at useR!conferences that were not only very satisfying "you had to be there" experiences but were also backed up with meticulously prepared materials of lasting value. This year, quite a few useR!20i6 tutorials measure up to this level of quality. My take on why things turned out...

Read more »

R, Stan and Bayesian Statistics

June 23, 2016
By
R, Stan and Bayesian Statistics

by Joseph Rickert Just about two and a half years ago I wrote about some resources for doing Bayesian statistics in R. Motivated by the tutorial Modern Bayesian Tools for Time Series Analysis by Harte and Weylandt that I attended at R/Finance last month, and the upcoming tutorial An Introduction to Bayesian Inference using R Interfaces to Stan that...

Read more »

Exploring Global Internet Performance Data Using R

June 21, 2016
By
Exploring Global Internet Performance Data Using R

by Lourdes O. Montenegro Lourdes O. Montenegro is a PhD candidate at the Lee Kuan Yew School of Public Policy, National University of Singapore. Her research interests cover the intersection of applied data science, technology, economics and public policy. Many of us now find it hard to live without a good quality internet connection. As a result, there is...

Read more »

The R Packages of UseR! 2016

June 16, 2016
By
The R Packages of UseR! 2016

by Joseph Rickert It is always a delight to discover a new and useful R package, and it is especially nice when the discovery comes with at context and testimonial to its effectiveness. It is also satisfying to be able to check in once in awhile and get an idea of what people think is hot, or current or...

Read more »

Using Microsoft R Server on a single machine for experiments with 600 million taxi rides.

June 14, 2016
By
Using Microsoft R Server on a single machine for experiments with 600 million taxi rides.

by Dmitry Pechyoni, Microsoft Data Scientist The New York City taxi dataset is one of the largest publicly available datasets. It has about 1.1 billion taxi rides in New York City. Previously this dataset was explored and visualized in a number of blog posts, where the authors used various technologies (e.g., PostgreSQL and Apache Elastic Search). Moreoever, in a...

Read more »

R Consortium and User! 2016 News

June 9, 2016
By

by Joseph Rickert IBM Joins the R Consortium This past Monday at the Spark Summit in San Francisco IBM announced that it had joined the R Consortium as a "Platinum" member. This is very good news with respect to the development and growth of the R language, the health of the R Community and the position of opensource software...

Read more »

Bayesian Optimization of Machine Learning Models

June 7, 2016
By
Bayesian Optimization of Machine Learning Models

by Max Kuhn: Director, Nonclinical Statistics, Pfizer Many predictive and machine learning models have structural or tuning parameters that cannot be directly estimated from the data. For example, when using K-nearest neighbor model, there is no analytical estimator for K (the number of neighbors). Typically, resampling is used to get good performance estimates of the model for a given...

Read more »

Using caret to compare models

June 2, 2016
By
Using caret to compare models

by Joseph Rickert The model table on the caret package website lists more that 200 variations of predictive analytics models that are available withing the caret framework. All of these models may be prepared, tuned, fit and evaluated with a common set of caret functions. All on its own, the table is an impressive testament to the utility and...

Read more »

Principal Components Regression in R: Part 3

May 31, 2016
By
Principal Components Regression in R: Part 3

by John Mount Ph. D. Data Scientist at Win-Vector LLC In her series on principal components analysis for regression in R, Win-Vector LLC's Dr. Nina Zumel broke the demonstration down into the following pieces: Part 1: the proper preparation of data and use of principal components analysis (particularly for supervised learning or regression). Part 2: the introduction of y-aware...

Read more »

Sponsors

Mango solutions



plotly webpage

dominolab webpage



Zero Inflated Models and Generalized Linear Mixed Models with R

Quantide: statistical consulting and training

datasociety

http://www.eoda.de





ODSC

ODSC

CRC R books series





Six Sigma Online Training









Contact us if you wish to help support R-bloggers, and place your banner here.

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)