September, 2017 | R-bloggers

Going, Going . . . 1

September 28, 2017 | jameshunterbr

Two posts today with similar themes. Time is running out. First, time is running out for Giancarlo Stanton. His bat has been very silent this week so far. The Marlins have 7 more games and he still needs 4 dingers … Continue reading → [Read more...]

Partial Pooling for Lower Variance Variable Encoding

September 28, 2017 | Nina Zumel

Banaue rice terraces. Photo: Jon Rawlinson In a previous article, we showed the use of partial pooling, or hierarchical/multilevel models, for level coding high-cardinality categorical variables in vtreat. In this article, we will discuss a little more about the how and why of partial pooling in R. We will ...

[Read more...]

How Good is That Random Number Generator?

September 28, 2017 | Dave Giles

Recently, I saw a reference to an interesting piece from 2013 by Peter Grogono, a computer scientist now retired from Concordia University. It's to do with checking the "quality" of a (pseudo-) random number generator. Specifically, Peter discusses what he calls "The Pickover Test". This refers to the following suggestion that ...

[Read more...]

Goodness of Fit in MDS and t-SNE with Shepard Diagrams

September 28, 2017 | Jake Hoare

The goodness of fit for data reduction techniques such as MDS and t-SNE can be easily assessed with Shepard diagrams. A Shepard diagram compares how far apart your data points are before and after you transform... [Read more...]

R 3.4.2 is released

September 28, 2017 | David Smith

The R Core team today announced the release of R 3.4.2. This release fixes a number of minor bugs and also includes a performance improvement to the commonly-used function c when applied to vectors with a names attribute. Like all minor releases, this release is backwards compatible with prior releases in ... [Read more...]

Gold-Mining – Week 4 (2017)

September 28, 2017 | Michael Griebe

Week 4 Gold Mining and Fantasy Football Projection Roundup now available. Go get that free agent gold! The post Gold-Mining – Week 4 (2017) appeared first on Fantasy Football Analytics. [Read more...]

SODD — StackOverflow Driven-Development

September 28, 2017 | hrbrmstr

I occasionally hang out on StackOverflow and often use an answer as an opportunity to fill a package void for a particular need. docxtractr and qrencoder are two (of many) packages that were birthed from SO answers. I usually try to answer with inline code first then expand the functionality ... [Read more...]

Oneway ANOVA Explanation and Example in R; Part 2

September 28, 2017 | Chuck Powell

Please read the first part published at DataScience+, if you haven’t. Effect sizes and the strength of our prediction One relatively common question in statistics or data science is, how “big” is the difference or the effect? At this point we can state with some statistical confidence that tire ...

[Read more...]

Marketing Multi-Channel Attribution model based on Sales Funnel with R

September 27, 2017 | Sergey

This is the last post in the series of articles about using Multi-Channel Attribution in marketing. In previous two articles (part 1 and part 2), we’ve reviewed a simple and powerful approach based on Markov chains that allows you to effectively attribute marketing channels. This is the last post in the ...

[Read more...]

Marketing Multi-Channel Attribution model based on Sales Funnel with R

September 27, 2017 | Sergey Bryl' (Analyzecore.com blog)

This is the last post in the series of articles about using Multi-Channel Attribution in marketing. In previous two articles (part 1 and part 2), we’ve reviewed a simple and powerful approach based on Markov chains that allows you to effectively attribute marketing channels. In this article, we will review another ...

[Read more...]

RcppZiggurat 0.1.4

September 27, 2017 | Thinking inside the box

A maintenance release of RcppZiggurat is now on the CRAN network for R. It switched the vignette to the our new pinp package and its two-column pdf default. The RcppZiggurat package updates the code for the Ziggurat generator which provides very fas... [Read more...]

???? Dortmund real estate market analysis: tree-based methods

September 27, 2017 | Iegor Rudnytskyi

In pervious posts traditional regression models were fitted to real estate data. In this post tree-based models, namely random forests and gradient boosting, are trained to predict prices of the rent. These methods typically outperform traditional regression models yielding smaller errors. Furthermore, tree-based methods are much more robust to overfitting, ...

[Read more...]

Blockchain & distributed ML – my report from the data2day conference

September 27, 2017 | Dr. Shirin Glander

Yesterday and today I attended the data2day, a conference about Big Data, Machine Learning and Data Science in Heidelberg, Germany. Topics and workshops covered a range of topics surrounding (big) data analysis and Machine Learning, like Deep Learn...

[Read more...]

Blockchain & distributed ML – my report from the data2day conference

September 27, 2017 | Shirin's playgRound

Yesterday and today I attended the data2day, a conference about Big Data, Machine Learning and Data Science in Heidelberg, Germany. Topics and workshops covered a range of topics surrounding (big) data analysis and Machine Learning, like Deep Learnin...

[Read more...]

CACE closed: EM opens up exclusion restriction (among other things)

September 27, 2017 | Keith Goldfeld

This is the third, and probably last, of a series of posts touching on the estimation of complier average causal effects (CACE) and latent variable modeling techniques using an expectation-maximization (EM) algorithm . What follows is a simplistic way to implement an EM algorithm in R to do principal strata estimation ...

[Read more...]

Featurizing images: the shallow end of deep learning

September 27, 2017 | David Smith

by Bob Horton and Vanja Paunic, Microsoft AI and Research Data Group Training deep learning models from scratch requires large data sets and significant computational reources. Using pre-trained deep neural network models to extract relevant features from images allows us to build classifiers using standard machine learning approaches that work ... [Read more...]

New Course! Supervised Learning in R: Classification

September 27, 2017 | DataCamp Blog

Hi there! We proud to launch our latest R & machine learning course, Supervised Learning in R: Classification! By Brett Lantz. This beginner-level introduction to machine learning covers four of the most common classification algorithms. You will ... [Read more...]

Oneway ANOVA Explanation and Example in R; Part 1

September 27, 2017 | Chuck Powell

This tutorial was inspired by a this post published at DataScience+ by Bidyut Ghosh. Special thanks also to Dani Navarro, The University of New South Wales (Sydney) for the book Learning Statistics with R (hereafter simply LSR) and the lsr packages available through CRAN. I highly recommend it. Let’s ...

[Read more...]

Abstract Data Types and the Uniform Referent Principle I: why Douglas T. Ross would hate nest(), unnest(), gather() and spread()

September 27, 2017 | Jocelyn Ireson-Paine

“What’s the Uniform Referent Principle?” my colleague asked me on reading my last post. I think I first came across it in Jean Sammet’s famous book Programming Languages: History and Fundamentals. In a description of Douglas Ross’s AED-0 language, she pointed out a feature that she thought ... [Read more...]

Churn Prediction with Automatic ML

September 27, 2017 | Dominik Krzemiński

Sometimes we don’t even realize how common machine learning (ML) is in our daily lives. Various “intelligent” algorithms help us for instance with finding the most important facts (Google), they suggest what movie to watch (Netflix), or influence our shopping decisions (Amazon). The biggest international companies quickly recognized the ...

[Read more...]

R-bloggers

R news and tutorials contributed by hundreds of R bloggers

September 2017

Going, Going . . . 1

Partial Pooling for Lower Variance Variable Encoding

How Good is That Random Number Generator?

Goodness of Fit in MDS and t-SNE with Shepard Diagrams

R 3.4.2 is released

Gold-Mining – Week 4 (2017)

SODD — StackOverflow Driven-Development

Oneway ANOVA Explanation and Example in R; Part 2

Marketing Multi-Channel Attribution model based on Sales Funnel with R

Marketing Multi-Channel Attribution model based on Sales Funnel with R

RcppZiggurat 0.1.4

???? Dortmund real estate market analysis: tree-based methods

Blockchain & distributed ML – my report from the data2day conference

Blockchain & distributed ML – my report from the data2day conference

CACE closed: EM opens up exclusion restriction (among other things)

Featurizing images: the shallow end of deep learning

New Course! Supervised Learning in R: Classification

Oneway ANOVA Explanation and Example in R; Part 1

Abstract Data Types and the Uniform Referent Principle I: why Douglas T. Ross would hate nest(), unnest(), gather() and spread()

Churn Prediction with Automatic ML

September 2017

Never miss an update! Subscribe to R-bloggers to receive e-mails with the latest R posts. (You will not see this message again.)

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)