(This article was first published on

Bootstrapping is a very popular statistical technique. However, its Bayesian analogue proposed by Rubin (1981) is not very common. I was looking for an example of its implementation in GNU R and could not find one so I decided to write a snippet presenting it.**R snippets**, and kindly contributed to R-bloggers)In standard bootstrapping observations are sampled with replacement. This implies that observation weights follow multinomial distribution. In Bayesian bootstrap multinomial distribution is replaced by Dirichlet distribution.

This observation leads to very simple implementation of Bayesian bootstrap using gtools package. Here is the code presenting it with a simple application giving frequentist and Bayesian 95% confidence interval for mean in fbq and bbq variables:

library

**(**gtools**)**# Bayesian bootstrap

mean.bb

**<-**

**function**

**(**x, n

**)**

**{**

apply

**(**rdirichlet**(**n, rep**(**1, length**(**x**)))**, 1, weighted.mean, x**=**x**)****}**

# standard bootstrap

mean.fb

mean.fb

**<-****function****(**x, n**)****{** replicate

**(**n, mean**(**sample**(**x, length**(**x**)**,**TRUE****)))****}**

set.seed

**(**1**)**reps

**<-**100000x

**<-**cars**$**distsystem.time

system.time**(**fbq**<-**quantile**((**mean.fb**(**x, reps**))**, c**(**0.025, 0.075**)))****(**bbq

**<-**quantile

**((**mean.bb

**(**x, reps

**))**, c

**(**0.025, 0.075

**)))**

As it can be seen implementation of Bayesian bootstrap is fairly simple.

On my computer Bayesian bootstrap is approximately 80% slower than standard bootstrap, but its performance probably could be improved.

To

**leave a comment**for the author, please follow the link and comment on his blog:**R snippets**.R-bloggers.com offers

**daily e-mail updates**about R news and tutorials on topics such as: visualization (ggplot2, Boxplots, maps, animation), programming (RStudio, Sweave, LaTeX, SQL, Eclipse, git, hadoop, Web Scraping) statistics (regression, PCA, time series, trading) and more...