Articles by That’s so Random

Binning Outliers in a Histogram

April 26, 2017 | That’s so Random

I guess we all use it, the good old histogram. One of the first things we are taught in Introduction to Statistics and routinely applied whenever coming across a new continuous variable. However, it easily gets messed up by outliers. Putting most of the data into a single bin or ...

[Read more...]

Preparing Datetime Data for Analysis with padr and dplyr

March 19, 2017 | That’s so Random

Two months ago padr was introduced, followed by an improved version that allowed for applying pad on group level. See the introduction blogs or the vignette("padr") for more package information. In this blog I give four more elaborate examples on how to go from raw data to insight with ...

[Read more...]

Tree-based univariate testing

February 26, 2017 | That’s so Random

When building a predictive model it is a good idea to do a univariate analysis, before throwing the whole bunch in a complex algorithm. This way we get a feel for the potential contribution of each predictor. When a lot of predictors are available one can often make a first ...

[Read more...]

padr::pad does now do group padding

February 18, 2017 | That’s so Random

A few weeks ago padr was introduced on CRAN, allowing you to quickly get datetime data ready for analysis. If you have missed this, see the introduction blog or vignette("padr") for a general introduction. In v0.2.0 the pad function is extended with a group argument, which makes your life ... [Read more...]

A wrapper around nested ifelse

February 7, 2017 | That’s so Random

The ifelse function is the way to do vectorised if then else in R. One of the first cool things I learned to do in R a few years back, I got from Norman Matloff’s The Art of R Programming. When you have more than one if then statements, ... [Read more...]

Introducing padr

January 17, 2017 | That’s so Random

I am happy to introduce the padr package, which is now available on CRAN. If you frequently work with data containing a timestamp, especially automatically created data, you might find this package helpful. It solves two problems that you can be confronted with when preparing datetime data for analysis. First, ...

[Read more...]

Building a column selecter

November 27, 2016 | That’s so Random

Maybe the following sounds familiar. You have a large data set with many, many columns of which the most are irrelevant to you. Typically, a dump from a database or the full set extracted from an API. Several times I found myself the better part of an afternoon going back ...

[Read more...]

Designing our bathroom with R

August 10, 2016 | That’s so Random

R has been an indispensable tool since I started working with it about five years ago. Of course in my day job as a data scientist I couldn’t live without it, but it also proved to be a great aid in private life. Recently we bought our first house ...

[Read more...]

« 1 2

R-bloggers

R news and tutorials contributed by hundreds of R bloggers

Articles by That’s so Random

Binning Outliers in a Histogram

Preparing Datetime Data for Analysis with padr and dplyr

Tree-based univariate testing

padr::pad does now do group padding

A wrapper around nested ifelse

Introducing padr

Building a column selecter

Designing our bathroom with R

Articles by That’s so Random

Never miss an update! Subscribe to R-bloggers to receive e-mails with the latest R posts. (You will not see this message again.)

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)