September 2017

Going, Going . . . 1

September 28, 2017 | jameshunterbr

Two posts today with similar themes. Time is running out.  First, time is running out for Giancarlo Stanton. His bat has been very silent this week so far. The Marlins have 7 more games and he still needs 4 dingers … Continue reading → [Read more...]

Partial Pooling for Lower Variance Variable Encoding

September 28, 2017 | Nina Zumel

Banaue rice terraces. Photo: Jon Rawlinson In a previous article, we showed the use of partial pooling, or hierarchical/multilevel models, for level coding high-cardinality categorical variables in vtreat. In this article, we will discuss a little more about the how and why of partial pooling in R. We will ...
[Read more...]

How Good is That Random Number Generator?

September 28, 2017 | Dave Giles

Recently, I saw a reference to an interesting piece from 2013 by Peter Grogono, a computer scientist now retired from Concordia University. It's to do with checking the "quality" of a (pseudo-) random number generator. Specifically, Peter discusses what he calls "The Pickover Test". This refers to the following suggestion that ...
[Read more...]

R 3.4.2 is released

September 28, 2017 | David Smith

The R Core team today announced the release of R 3.4.2. This release fixes a number of minor bugs and also includes a performance improvement to the commonly-used function c when applied to vectors with a names attribute. Like all minor releases, this release is backwards compatible with prior releases in ... [Read more...]

SODD — StackOverflow Driven-Development

September 28, 2017 | hrbrmstr

I occasionally hang out on StackOverflow and often use an answer as an opportunity to fill a package void for a particular need. docxtractr and qrencoder are two (of many) packages that were birthed from SO answers. I usually try to answer with inline code first then expand the functionality ... [Read more...]

Oneway ANOVA Explanation and Example in R; Part 2

September 28, 2017 | Chuck Powell

Please read the first part published at DataScience+, if you haven’t. Effect sizes and the strength of our prediction One relatively common question in statistics or data science is, how “big” is the difference or the effect? At this point we can state with some statistical confidence that tire ...
[Read more...]

RcppZiggurat 0.1.4

September 27, 2017 | Thinking inside the box

A maintenance release of RcppZiggurat is now on the CRAN network for R. It switched the vignette to the our new pinp package and its two-column pdf default. The RcppZiggurat package updates the code for the Ziggurat generator which provides very fas... [Read more...]

???? Dortmund real estate market analysis: tree-based methods

September 27, 2017 | Iegor Rudnytskyi

In pervious posts traditional regression models were fitted to real estate data. In this post tree-based models, namely random forests and gradient boosting, are trained to predict prices of the rent. These methods typically outperform traditional regression models yielding smaller errors. Furthermore, tree-based methods are much more robust to overfitting, ...
[Read more...]

Featurizing images: the shallow end of deep learning

September 27, 2017 | David Smith

by Bob Horton and Vanja Paunic, Microsoft AI and Research Data Group Training deep learning models from scratch requires large data sets and significant computational reources. Using pre-trained deep neural network models to extract relevant features from images allows us to build classifiers using standard machine learning approaches that work ... [Read more...]

Oneway ANOVA Explanation and Example in R; Part 1

September 27, 2017 | Chuck Powell

This tutorial was inspired by a this post published at DataScience+ by Bidyut Ghosh. Special thanks also to Dani Navarro, The University of New South Wales (Sydney) for the book Learning Statistics with R (hereafter simply LSR) and the lsr packages available through CRAN. I highly recommend it. Let’s ...
[Read more...]

Churn Prediction with Automatic ML

September 27, 2017 | Dominik Krzemiński

Sometimes we don’t even realize how common machine learning (ML) is in our daily lives. Various “intelligent” algorithms help us for instance with finding the most important facts (Google), they suggest what movie to watch (Netflix), or influence our shopping decisions (Amazon). The biggest international companies quickly recognized the ...
[Read more...]
1 2 3 4 15

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)