Blog Archives

Computing Classification Evaluation Metrics in R

March 11, 2016
By

by Said Bleik, Shaheen Gauher, Data Scientists at Microsoft Evaluation metrics are the key to understanding how your classification model performs when applied to a test dataset. In what follows, we present a tutorial on how to compute common metrics that are often used in evaluation, in addition to metrics generated from random classifiers, which help in justifying the...

Read more »

Bay Area R User Group at Strata and PAW

March 10, 2016
By

by Joseph Rickert I always think of Strata Hadoop World and Predictive Analytics World as initiating the Spring conference season here in the San Francisco Bay Area. The rainy season is usually over by the end of March and it is a perfect time to visit. If you are traveling to either of these conferences from out of town...

Read more »

Confidence Intervals for Random Forests

March 3, 2016
By
Confidence Intervals for Random Forests

by Joseph Rickert Random Forests, the "go to" classifier for many data scientists, is a fairly complex algorithm with many moving parts that introduces randomness at different levels. Understanding exactly how the algorithm operates requires some work, and assessing how good a Random Forests model fits the data is a serious challenge. In the pragmatic world of machine learning...

Read more »

Analysing the movements of a cat

March 1, 2016
By
Analysing the movements of a cat

by Verena Haunschmid Since I have a cat tracker, I wanted to do some analysis of the behavior of my cats. I have shown how to do some of these things here. Data Collection The data was collected using the Tractive GPS Pet Tracker over a period of about one year from January 2014 to November 2014 (with breaks)....

Read more »

Generating and Visualizing Multivariate Data with R

February 25, 2016
By
Generating and Visualizing Multivariate Data with R

By Joseph Rickert The ability to generate synthetic data with a specified correlation structure is essential to modeling work. As you might expect, R’s toolbox of packages and functions for generating and visualizing data from multivariate distributions is impressive. The basic function for generating multivariate normal data is mvrnorm() from the MASS package included in base R, although the...

Read more »

Analysis of Xbox Usage Trends with R: Time Series Decompositions

February 23, 2016
By
Analysis of Xbox Usage Trends with R: Time Series Decompositions

by Daniel Moore Director of Applied Statistics Engineering, Console Development Microsoft In Xbox Hardware, we are interested in the various ways that our hardware is used, and we are especially interested in how that usage changes over time. We employ several several time series analysis techniques that are helpful in getting a holistic view of usage of the Xbox...

Read more »

Bay Area useR Group Lightning Talks

February 18, 2016
By
Bay Area useR Group Lightning Talks

by Joseph Rickert Earlier this month the Bay Area useR Group (BARUG) held it annual lightning talk meeting. This is by far our most popular meeting format: eight, 15 minute talks (12 minutes speaking and 3 minutes Q & A while the next speaker is setting up) packed into a two hour time slot. The intensity seems to really...

Read more »

More R User Group Sites

February 16, 2016
By
More R User Group Sites

by Joseph Rickert Last month I wrote about how several R user groups were making use of GitHub and listed some sites that I thought had interesting material. A few readers were kind enough to point out sites that I had missed; so I would just like to give a couple of "shout outs" here. First of all, I...

Read more »

Getting Started with Microsoft SQL Server on Azure

February 11, 2016
By
Getting Started with Microsoft SQL Server on Azure

by Joseph Rickert If you are an R user and work for an enterprise where Microsoft SQL server is important, it is extremely helpful to have easy access to SQL Server databases. Over the past year, we have run several posts on this topic including a comprehensive four part series from Microsoft's Gregory Vandenbrouck on using various flavors of...

Read more »

R User Groups in Poland

February 9, 2016
By
R User Groups in Poland

by Przemyslaw Biecek The first meeting of R users in Poland took place in Wroclaw in 2008. It was a one-day conference with 27 participants and 6 talks. Today, we have three large groups of R users in major Polish cities (according to meetup.com there is 640 users in SER - Warsaw, 235 in eRka - Cracow and 64...

Read more »

Sponsors

Mango solutions



RStudio homepage



Zero Inflated Models and Generalized Linear Mixed Models with R

Dommino data lab

Quantide: statistical consulting and training



http://www.eoda.de







ODSC

ODSC

CRC R books series





Six Sigma Online Training



Contact us if you wish to help support R-bloggers, and place your banner here.

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)