Blog Archives

fluent-r: a new R analytics integration library for JVM developers

November 10, 2015
By
fluent-r: a new R analytics integration library for JVM developers

by David Russell, fluent-r developer fluent-r is a new R analytics integration library for JVM application developers that improves upon existing solutions for integrating R analytics services delivered by popular open source R integration servers DeployR and OpenCPU. The fluent-r library provides a natural-language DSL alongside a simple API that can be used to replace or complement existing use...

Read more »

Accessing Bitcoin Data with R

November 4, 2015
By

by Joseph Rickert I am not yet a Bitcoin advocate. Nevertheless, I am impressed with the amount of Bitcoin activity and the progress that advocates are making towards having Bitcoin recognized as a legitimate currency. Right now, I am mostly interested in the technology behind bitcoin and the possibility of working with some interesting data sets. A good bit...

Read more »

Differential Privacy Mini-series from Win-Vector

November 3, 2015
By
Differential Privacy Mini-series from Win-Vector

by Nina Zumel Principal Consultant Win-Vector LLC We've just finished off a series of articles on some recent research results applying differential privacy to improve machine learning. Some of these results are pretty technical, so we thought it was worth working through concrete examples. And some of the original results are locked behind academic journal paywalls, so we've tried...

Read more »

Instrumental Variables

October 29, 2015
By
Instrumental Variables

by Joseph Rickert We all "know" that correlation does not imply causation, that unmeasured and unknown factors can confound a seemingly obvious inference. But, who has not been tempted by the seductive quality of strong correlations? Fortunately, it is also well known that a well done randomized experiment can account for the unknown confounders and permit valid causal inferences....

Read more »

Party with the First Tribe

October 22, 2015
By
Party with the First Tribe

by Joseph Rickert In a recent previous post, I wrote about support vector machines, the representative master algorithm of the 5th tribe of machine learning practitioners described by Pedro Domingos in his book, The Master Algorithm. Here we look into algorithms favored by the first tribe, the symbolists, who see learning as the process of inverse deduction. Pedro writes:...

Read more »

The 5th Tribe, Support Vector Machines and caret

October 15, 2015
By
The 5th Tribe, Support Vector Machines and caret

by Joseph Rickert In his new book, The Master Algorithm, Pedro Domingos takes on the heroic task of explaining machine learning to a wide audience and classifies machine learning practitioners into 5 tribes*, each with its own fundamental approach to learning problems. To the 5th tribe, the analogizers, Pedro ascribes the Support Vector Machine (SVM) as it's master algorithm....

Read more »

Using miniCRAN in Azure ML

October 13, 2015
By
Using miniCRAN in Azure ML

by Michele Usuelli Microsoft Data Scientist Azure Machine Learning Studio is a drag-and-drop tool to deploy data-driven solutions. It contains pre-built items including data preparation tools and Machine Learning algorithms. In addition, it allows to include R and Python custom scripts. In order to build powerful R tools, you might want to use some packages from the CRAN repository....

Read more »

Learning R: Index of Online R Courses, October 2015

October 8, 2015
By
Learning R: Index of Online R Courses, October 2015

by Joseph Rickert Early October: somewhere the leaves are turning brilliant colors, temperatures are cooling down and that back to school feeling is in the air. And for more people than ever before, it is going to seem to be a good time to commit to really learning R. I have some suggestions for R courses below, but first:...

Read more »

R User Groups Highlight R Creativity

October 1, 2015
By
R User Groups Highlight R Creativity

by Joseph Rickert I have been a big fan of R user groups since I attended my first meeting. There is just something about the vibe of being around people excited about what they are doing that feels good. From a speaker's perspective, presenting at an R user Group meeting must be the rough equivalent of doing "stand-up" at...

Read more »

Why Big Data? Learning Curves

September 29, 2015
By
Why Big Data? Learning Curves

by Bob Horton Microsoft Senior Data Scientist Learning curves are an elaboration of the idea of validating a model on a test set, and have been widely popularized by Andrew Ng’s Machine Learning course on Coursera. Here I present a simple simulation that illustrates this idea. Imagine you use a sample of your data to train a model, then...

Read more »

Sponsors

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)