Blog Archives

Book Review: Graphical Data Analysis with R

April 7, 2016
By
Book Review: Graphical Data Analysis with R

by Joseph Rickert Basically, there are two kinds of graphics or plots you can make from a data set: (1) those that allow you to see what is going on with the data, and (2) those you make to communicate what you have found to someone else. When making the first kind, you want to select plots that will...

Read more »

An Analysis of Traffic Violation Data with SQL Server and R

April 6, 2016
By
An Analysis of Traffic Violation Data with SQL Server and R

By Srini Kumar, Director of Data Science at Microsoft Who does not hate being stopped and given a traffic ticket? Invariably, we think that something is not fair that we got it and everyone else did not. I am no different, and living in the SF Bay Area, I have often wondered if I could get the data about...

Read more »

What’s new on CRAN: March 2016

March 31, 2016
By
What’s new on CRAN: March 2016

by Joseph Rickert Packages continue to flood into CRAN at a rate the challenges the sanity of anyone trying to keep up with what's new. So far this month, more than 190 packages have been added. Here is a my view of what's interesting in this March madness. The launch_tutorial() function from the RtutoR package by Anup Nair launches...

Read more »

Learning from Learning Curves

March 29, 2016
By
Learning from Learning Curves

by Bob Horton, Senior Data Scientist, Microsoft This is a follow-up to my earlier post on learning curves. A learning curve is a plot of predictive error for training and validation sets over a range of training set sizes. Here we’re using simulated data to explore some fundamental relationships between training set size, model complexity, and prediction error. Start...

Read more »

Get ready for the New York R Conference

March 24, 2016
By
Get ready for the New York R Conference

by Joseph Rickert Last year, I wrote the New York R Conference “set the bar pretty darn high for a first time conference”. Not only was there an outstanding lineup of speakers, but the energy and enthusiasm that conference attendees brought with them, or maybe just generated on the spot, was remarkable. This year I am definitely looking forward...

Read more »

Scoring R Models with Excel

March 17, 2016
By
Scoring R Models with Excel

by Joseph Rickert In a post late last year, my colleague and fellow blogger, Andrie de Vries described enhancements to the AzureML R package that makes it easy to publish R functions that consume data frames as Azure Web Services. A very nice consequence is that it is now feasible to develop predictive models in R and enable the...

Read more »

Computing Classification Evaluation Metrics in R

March 11, 2016
By

by Said Bleik, Shaheen Gauher, Data Scientists at Microsoft Evaluation metrics are the key to understanding how your classification model performs when applied to a test dataset. In what follows, we present a tutorial on how to compute common metrics that are often used in evaluation, in addition to metrics generated from random classifiers, which help in justifying the...

Read more »

Bay Area R User Group at Strata and PAW

March 10, 2016
By

by Joseph Rickert I always think of Strata Hadoop World and Predictive Analytics World as initiating the Spring conference season here in the San Francisco Bay Area. The rainy season is usually over by the end of March and it is a perfect time to visit. If you are traveling to either of these conferences from out of town...

Read more »

Confidence Intervals for Random Forests

March 3, 2016
By
Confidence Intervals for Random Forests

by Joseph Rickert Random Forests, the "go to" classifier for many data scientists, is a fairly complex algorithm with many moving parts that introduces randomness at different levels. Understanding exactly how the algorithm operates requires some work, and assessing how good a Random Forests model fits the data is a serious challenge. In the pragmatic world of machine learning...

Read more »

Analysing the movements of a cat

March 1, 2016
By
Analysing the movements of a cat

by Verena Haunschmid Since I have a cat tracker, I wanted to do some analysis of the behavior of my cats. I have shown how to do some of these things here. Data Collection The data was collected using the Tractive GPS Pet Tracker over a period of about one year from January 2014 to November 2014 (with breaks)....

Read more »

Sponsors

Mango solutions



RStudio homepage



Zero Inflated Models and Generalized Linear Mixed Models with R

Quantide: statistical consulting and training



http://www.eoda.de







ODSC

ODSC

CRC R books series











Contact us if you wish to help support R-bloggers, and place your banner here.

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)