Articles by R on Datentrang

New plot functionality for ClustImpute 0.2.0 and other improvements

April 1, 2021 | R on Datentrang

Let’s create some dummy data… ### Random Dataset set.seed(739) n [Read more...]

Interactive Individual Conditional Expectation (ICE) plots

April 1, 2021 | R on Datentrang

This post is not about a new technique or package, but rather combining existing functionality in interpretable machine learning and data visualization in a way to facilitate analyses of model results. We’ll make use of two packages DALEX and PLOTLY ot... [Read more...]

Developing an R package from scratch with Travis continuous integration

July 20, 2019 | R on Datentrang

This short tutorial provdes a quick guide on how to develop an R package from scratch and how use Travis CI for automatic builds on various R versions and automatic test coverage calculation. The resulting package can be found here: CIexamplePkg A very nice general introduction can be found here: ... [Read more...]

Measuring feature importance in k-means clustering and variants thereof

July 9, 2019 | R on Datentrang

We present a novel approach for measuring feature importance in k-means clustering, or variants thereof, to increase the interpretability of clustering results. In supervised machine learning, feature importance is a widely used tool to ensure interpretability of complex models. We adapt this idea to unsupervised learning via partitional clustering. Our ...

[Read more...]

Benchmarking missing data strategies for k-means clustering

June 30, 2019 | R on Datentrang

The goal is to compare a few algorithms for missing imputation when used before k-means clustering is performed. For the latter we use the same algorithm as in ClustImpute to ensure that only the computation time of the imputation is compared. In a nutshell, we’ll se that ClustImpute scales ...

[Read more...]

Intoducing ClustImpute: A new approach for k-means clustering with build-in missing data imputation

June 19, 2019 | R on Datentrang

We are happily introducing a new k-means clustering algorithm that includes a powerful multiple missing data imputation at the computational cost of a few extra random imputations (benchmarks following in a separate article). More precisely, the algorithm draws the missing values iteratively based on the current cluster assignment so that ...

[Read more...]

R-bloggers

R news and tutorials contributed by hundreds of R bloggers

Articles by R on Datentrang

New plot functionality for ClustImpute 0.2.0 and other improvements

Interactive Individual Conditional Expectation (ICE) plots

Developing an R package from scratch with Travis continuous integration

Measuring feature importance in k-means clustering and variants thereof

Benchmarking missing data strategies for k-means clustering

Intoducing ClustImpute: A new approach for k-means clustering with build-in missing data imputation

Articles by R on Datentrang

Never miss an update! Subscribe to R-bloggers to receive e-mails with the latest R posts. (You will not see this message again.)

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)