Blog Archives

Coding Random Forests in 100 lines of code*

June 5, 2019
By
Coding Random Forests in 100 lines of code*

In our series of explaining method in 100 lines of code, we tackle random forest this time! We build it from scratch and explore it's functions. Der Beitrag Coding Random Forests in 100 lines of code* erschien zuerst auf STATWORX.

Read more »

Coding Regression trees in 150 lines of R code

November 9, 2018
By
Coding Regression trees in 150 lines of R code

Motivation There are dozens of machine learning algorithms out there. It is impossible to learn all their mechanics, however, many algorithms sprout from the most established algorithms, e.g. ordinary least squares, gradient boosting, support vector machines, tree-based algorithms and neural networks. At STATWORX we discuss algorithms daily to evaluate their usefulness for a specific project. In any case, understanding...

Read more »

Coding Gradient boosted machines in 100 lines of code

November 5, 2018
By
Coding Gradient boosted machines in 100 lines of code

Motivation There are dozens of machine learning algorithms out there. It is impossible to learn all their mechanics, however, many algorithms sprout from the most established algorithms, e.g. ordinary least squares, gradient boosting, support vector machines, tree-based algorithms and neural networks. At STATWORX we discuss algorithms daily to evaluate their usefulness for a specific project or problem. In any...

Read more »

Benchmarking Feature Selection Algorithms with Xy()

July 20, 2018
By
Benchmarking Feature Selection Algorithms with Xy()

Feature Selection Feature Selection is one of the most interesting fields in machine learning in my opinion. It is a boundary point of two different perspectives on machine learning – performance and inference. From a performance point of view, feature selection is typically used to increase the model performance or to reduce the complexity of the problem in order...

Read more »

Pushing Ordinary Least Squares to the limit with Xy()

July 2, 2018
By
Pushing Ordinary Least Squares to the limit with Xy()

Introduction to Xy() Simulation is mostly about answering particular research questions. Whenever the word simulation appears somewhere in a discussion, everyone knows that this means additional effort. At STATWORX we are using simulations as a first step to proof concepts we are developing. Sometimes such a simulation is simple, in other cases a simulation is plenty of work. Though,...

Read more »

burglr – stealing code from the web

April 13, 2018
By
burglr – stealing code from the web

Introduction All we do at STATWORX all day long is stealing code from the web. That is why I thought it would only be fair to code a function which does that conveniently. With burglr you have all functions and kickass machine learning models at your fingertips. This would have been a more exciting description of the function I...

Read more »

Dive – the debugging function you deserve

March 19, 2018
By
Dive – the debugging function you deserve

A programmer’s misery Writing R functions is always fun up until the point where you have to debug them. Especially if you are layering functions. In the best case, you can just use the native debug functionalities of R. However, in my opinion this is only feasible when you code in RStudio, because you can use the interface to...

Read more »

Search R-bloggers

Sponsors

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)