Blog Archives

Unemployment in Europe

February 2, 2016
By
Unemployment in Europe

A couple of years I have made plots of unemployment and its change over the years. At first this was a bigger and complex piece of code. As things have progressed, the code can now become pretty concise. There are just plenty of packages to do the heav...

Read more »

A simple ANOVA

January 17, 2016
By
A simple ANOVA

I was browsing Davies Design and Analysis of Industrial Experiments (second edition, 1967). Published by for ICI in times when industry did that kind of thing. It is quite an applied book. On page 107 there is an example where the variance of a process is estimated.DataData is from nine batches from which three samples were selected (A, B and...

Read more »

A plot of ‘Who works at home’

January 3, 2016
By
A plot of ‘Who works at home’

I ran across this post containing displays on who works from home. I must say it looks great and is interactive but it did not help me understand the data. So I created this post to display the same data with a boring plot which might help me. For...

Read more »

Vacancies in the Netherlands

December 12, 2015
By
Vacancies in the Netherlands

Over the last couple of years, each weekend I have registering how many vacancies websites claim to have. This post shows some of the observations one may draw from the plots.DataData is from general and more specialized websites. The first observation...

Read more »

Wind in Netherlands II

November 29, 2015
By
Wind in Netherlands II

Two weeks ago I plotted how wind measurements on the edge of the North Sea changed in the past century. This week the same dataset is used for hypothesis testing.DataThe most important things to reiterate from previous post is that the data is from KNM...

Read more »

Wind in Netherlands

November 15, 2015
By
Wind in Netherlands

In climate change discussions, everybody talks about temperature. But weather is much more than that. There is at least rain and wind as directly experienced quality, and air pressure as measurable quantity. In the Netherlands, some observation station...

Read more »

Vacancies in Europe

November 1, 2015
By
Vacancies in Europe

I like playing around with data from Eurostat. At this time the tools to do so are just so easy. There are tools to pull the data directly from the data base in R (eurostat package). Process it a bit using dplyr and before you know it, ggplot makes a p...

Read more »

Trying to optimize

October 18, 2015
By

I wanted to try some more machine learning. On Kaggle there is a competition How Much Did It Rain? II. This is quite a bigger data set than Titanic. To quote from Kaggle:Rainfall is highly variable across space and time, making it notoriously tricky t...

Read more »

Predicting Titanic deaths on Kaggle VII: More Stan

October 4, 2015
By

Two weeks ago I used STAN to create predictions after just throwing in all independent variables. This week I aim to refine the STAN model. For this it is convenient to use the loo package (Efficient Leave-One-Out Cross-Validation and WAIC for Bayesian...

Read more »

Predicting Titanic deaths on Kaggle VI: Stan

September 19, 2015
By

It is a bit a contradiction. Kaggle provides competitions on data science, while Stan is clearly part of the (Bayesian) statistics. Yet after using random forests, boosting and bagging, I also think this problem has a suitable size for Stan, which I un...

Read more »

Sponsors

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)