The 10 Data Science Crack Commandments

[This article was first published on rstats – MikeJackTzen, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

 It’s the ten crack commandments, what? homie can’t tell me nothing about this code Can’t tell me nothing about these #rstats

Number 1, make a function from a script. Everyone knows we’re to busy to be copy/pasting shit

Number 2, never let ’em know your data manipulation moves. Don’t you know Bad Boys move in silence and violence?

Number 3: never trust point-o-five p’s, your moms’ll set that ass up, properly gassed up, hoodie to mask up, for that fast buck

Number 4: I know you heard this before “Never compute high on your own CPU supply”

Number 5: never store PII where you rest at

Number 6: that goddamn STATA*? Dead it You think a crackhead paying you back, shit forget it! (*STATA/SAS/SPSS)

Numero Siete: this rule is so underrated Keep your training and test set completely seperated Money and blood don’t mix like two…

Number 8, always keep survey weights on you. Them cats that squeeze your guns can ask what population your stats generalize to

Number 9 shoulda been Number 1 to me: If you ain’t gettin’ representative samples stay the fuck from police data

Number 10, a strong word called Bayes-i-an Strictly for live men, not for freshmen



To leave a comment for the author, please follow the link and comment on their blog: rstats – MikeJackTzen. offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)