Benford’s law, or the First-digit law

[This article was first published on Statistic on aiR, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

Benford’s law, also called the first-digit law, states that in lists of numbers from many (but not all) real-life sources of data, the leading digit is distributed in a specific, non-uniform way. According to this law, the first digit is 1 about 30% of the time, and larger digits occur as the leading digit with lower and lower frequency, to the point where 9 as a first digit occurs less than 5% of the time.
Wikipedia, retrieved 08/25/2011





R simulation:
library(MASS) benford <- function(m, n){ list <- c() # compute all m^n, for n= 1, 2, ..., i, ..., n for(i in 1:n){ list[i] <- m^i } # a function to extract the first digit from a number bben <- function(k){ as.numeric(head(strsplit(as.character(k),'')[[1]],n=1)) } # extract the first digit from all numbers computed first.digit <- sapply(list, bben) # plot frequency of first digits truehist(first.digit, nbins=10, main=m) } par(mfrow=c(2,2)) benford(2,1000) benford(3,640) # if n is greater, it returns "inf" (on my pc) benford(4,500) benford(5,440)


To leave a comment for the author, please follow the link and comment on their blog: Statistic on aiR.

R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)