What has Kaggle learned from 2 million machine learning models?

February 5, 2016
By

What has Kaggle learned from 2 million machine learning models? Anthony Goldbloom, founder and CEO...

Read more »

Pitfall of XML package: to know the cause

February 5, 2016
By
Pitfall of XML package:  to know the cause

This is the sequel to the previous report “issues specific to cp932 locale, Japanese Shift-JIS, on Windows“.  In this report, I will dig the issues deeper to find out...

Read more »

Speaking at DataPhilly February 2016

February 5, 2016
By
Speaking at DataPhilly February 2016

The next DataPhilly meetup will feature a medley of machine-learning talks, including an Intro to ML from yours truly. Check out the speakers list and be sure to RSVP....

Read more »

Introducing Microsoft R Open: Replay and slides

February 5, 2016
By

We had a fantastic turnout to last week's webinar, Introduction to Microsoft R Open. If you missed it, you can watch the replay below. In the talk, I gives...

Read more »

Shiny Developer Conference 2016 Recap

February 5, 2016
By

This is a guest post from VP Nagraj, a data scientist embedded within UVA’s Health Sciences Library, who runs our

Read more »

Cricket analytics with cricketr in paperback and Kindle versions

February 5, 2016
By
Cricket analytics with cricketr in paperback and Kindle versions

My book “Cricket analytics with cricketr” is now available in paperback and Kindle versions. The paperback is available from Amazon (US, UK and Europe) for $ 48.99. The Kindle...

Read more »

New Version of “Wrangling F1 Data With R” Just Released…

February 5, 2016
By
New Version of “Wrangling F1 Data With R” Just Released…

So I finally got round to pushing a revised (and typo corrected!) version of Wrangling F1 Data With R: A Data Junkie’s Guide, that also includes a handful of...

Read more »

Data from the World Health Organization API

February 5, 2016
By
Data from the World Health Organization API

Eric Persson released yesterday a new WHO R package which allows easy access to the World Health Organization’s data API. He’s also done a nice...

Read more »

Alternate R Markdown Templates

February 4, 2016
By

The knitr/R markdown system is a great way to organize reports and analyses. However, the built-in ones (that come with RStudio/the rmarkdown package) rely on Bootstrap and also use...

Read more »

Death Comes to Us All

February 4, 2016
By
Death Comes to Us All

I have been working with a data set on causes of death in my adopted home state of Utah for a little while now, and I had been struggling...

Read more »

OpenCPU Server Release 1.5.4

February 4, 2016
By
OpenCPU Server Release 1.5.4

Version 1.5.4 of the OpenCPU server has been released to Launchpad (Ubuntu) and OBS (Fedora). This update does not introduce...

Read more »

Free video course: applied Bayesian A/B testing in R

February 4, 2016
By
Free  video course: applied Bayesian A/B testing in R

As a “thank you” to our blog, mailing list, and Twitter followers (@WinVectorLLC) we at Win-Vector LLC have decided to re-release our formerly fee-based A/B testing video course as...

Read more »

Weekly R-Tips: Visualizing Predictions

February 4, 2016
By
Weekly R-Tips: Visualizing Predictions

Lets say that we estimated a linear regression model on time series data with lagged predictors. The goal is to estimate sales as a function of inventory, search volume,...

Read more »

Predicting wine quality using Random Forests

February 4, 2016
By
Predicting wine quality using Random Forests

Hello everyone! In this article I will show you how to run the random forest algorithm in R. We will use the wine quality data set (white) from the...

Read more »

Using Microsoft R Open with RStudio

February 4, 2016
By
Using Microsoft R Open with RStudio

by Joseph Rickert A frequent question that we get here at Microsoft about MRO (Microsoft R Open) is: can be used with RStudio? The short answer is absolutely yes!...

Read more »

The R-Podcast Episode 17: A Simply Radiant Chat with Vincent Nijs

February 3, 2016
By

The R-Podcast continues its series on Shiny and the first-ever Shiny Developer Conference by catching up with Vincent Nijs, associate professor of marketing at UC San Diego and one...

Read more »

optimal simulation on a convex set

February 3, 2016
By
optimal simulation on a convex set

This morning, we had a jam session at the maths department of Paris-Dauphine where a few researchers & colleagues of mine presented their field of research to the whole...

Read more »

Simple Distributions for Mixtures?

February 3, 2016
By
Simple Distributions for Mixtures?

The idea of GLMs is that given some covariates,  has a distribution in the exponential family (Gaussian, Poisson, Gamma, etc). But that does not mean that  has a similar distribution… so...

Read more »

Mapping the world’s longest plane fights

February 3, 2016
By
Mapping the world’s longest plane fights

If you're one of those people that dreads long plane flights, this map by Matt Strimas-Mackey will help you find routes to avoid. It shows Wikipedia's list of the...

Read more »

When k-means Clustering Fails

February 2, 2016
By
When k-means Clustering Fails

This entry is part 19 of 19 in the series Using RLetting the computer automatically find groupings in data is incredibly powerful and is at the heart of “data...

Read more »

Commonmark: Super Fast Markdown Rendering in R

February 2, 2016
By
Commonmark: Super Fast Markdown Rendering in R

A few months ago I first announced the commonmark R package. Since then there have been a few more releases… time for...

Read more »

Unemployment in Europe

February 2, 2016
By
Unemployment in Europe

A couple of years I have made plots of unemployment and its change over the years. At first this was a bigger and complex piece of code. As things...

Read more »

memoise 1.0.0

February 2, 2016
By
memoise 1.0.0

We are pleased to announce version 1.0.0 of the memoise package is now available on CRAN. Memoization stores the value of function call and returns the cached result when...

Read more »

tidyr 0.4.0

February 2, 2016
By
tidyr 0.4.0

I’m pleased to announce tidyr 0.4.0. tidyr makes it easy to “tidy” your data, storing it in a consistent form so that it’s easy to manipulate, visualise and model....

Read more »

httr 1.1.0 (and 1.0.0)

February 2, 2016
By
httr 1.1.0 (and 1.0.0)

httr 1.1.0 is now available on CRAN. The httr packages makes it easy to talk to web APIs from R. Learn more in the quick start vignette. Install the...

Read more »

7 Ways to Perplex a Data Scientist

February 2, 2016
By
7 Ways to Perplex a Data Scientist

On the heels of a report showing the inefficacy of government-run cyber security, it’s imperative to understand the limitations of …Continue reading →

Read more »

Devtools 1.10.0

February 2, 2016
By
Devtools 1.10.0

Devtools 1.10.0 is now available on CRAN. Devtools makes package building so easy that a package can become your default way to organise code, data, documentation, and tests. You...

Read more »

2015 in review and a preview of 2016

February 2, 2016
By

DataCamp’s mission is to build the best online platform for data science education with a focus on R and Python. In this post, we share our journey during...

Read more »

Like peanut butter and jelly: x13binary and seasonal

February 2, 2016
By

This post was written by Dirk Eddelbuettel and Christoph Sax and will be posted on both author's respective blogs. The seasonal package by Christoph Sax brings...

Read more »

Sponsors