Vector Search vs. Binary Search
[This article was first published on Yet Another Blog in Statistical Computing » S+/R, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.
# REFERENCE:
# user2014.stat.ucla.edu/files/tutorial_Matt.pdf
pkgs <- c('data.table', 'rbenchmark')
lapply(pkgs, require, character.only = T)
load('2008.Rdata')
dt <- data.table(data)
benchmark(replications = 10, order = "elapsed",
vector_search = {
test1 <- dt[ArrTime == 1500 & Origin == 'ABE', ]
},
binary_search = {
setkey(dt, ArrTime, Origin)
test2 <- dt[.(1500, 'ABE'), ]
}
)
# test replications elapsed relative user.self sys.self user.child
# 2 binary_search 10 0.335 1.000 0.311 0.023 0
# 1 vector_search 10 7.245 21.627 7.102 0.131 0
To leave a comment for the author, please follow the link and comment on their blog: Yet Another Blog in Statistical Computing » S+/R.
R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.