Site icon R-bloggers

Is the Tax Code the longest Title?

[This article was first published on Bommarito Consulting » r, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

  Last week, I shared that Dan Katz and I had finally published a draft of our paper, Measuring the Complexity of the Law: The U.S. Code.  We’d previewed this research on Computational Legal Studies years ago.  Since then, we’ve received great feedback and a number of questions.

  The most common question, even among legal professionals, is exactly what you’d guess – is the Tax Code (i.e., Title 26, I.R.C.) the longest Title?  The answer, in our opinion, is also what you might guess – it depends.  But first, let’s look at a few measures of Titles in the Code.

Element Count Distribution

Section Count Distribution

Tokens per Section Distribution

Token Count Distribution

(These plots are all based on data from our Github repository and the source to reproduce them can be found in this R and ggplot2 gist.)

  What do we notice?  Title 26 is not the longest or biggest of any measure.  It doesn’t have the most words (Title 42), the most elements/sections (again, Title 42), or even the most words per section (Title 23).  So what can we say?

U.S. Code network structure

  Hopefully, this discussion has piqued your interest in measuring legal complexity and raised your awareness around some common pitfalls.  If so, please give our paper a read and let us know what you think!

To leave a comment for the author, please follow the link and comment on their blog: Bommarito Consulting » r.

R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.