The 10 Data Science Crack Commandments

March 31, 2018
By

(This article was first published on rstats – MikeJackTzen, and kindly contributed to R-bloggers)

 It’s the ten crack commandments, what? homie can’t tell me nothing about this code Can’t tell me nothing about these #rstats

Number 1, make a function from a script. Everyone knows we’re to busy to be copy/pasting shit

http://adv-r.had.co.nz/Functions.html

Number 2, never let ’em know your data manipulation moves. Don’t you know Bad Boys move in silence and violence?

http://andrewgelman.com/2018/03/13/fear-many-people-drawing-wrong-lessons-wansink-saga-focusing-procedural-issues-p-hacking-rather-scientifically-important-concerns-2/

Number 3: never trust point-o-five p’s, your moms’ll set that ass up, properly gassed up, hoodie to mask up, for that fast buck

https://www.nature.com/articles/s41562-017-0189-z

Number 4: I know you heard this before “Never compute high on your own CPU supply”

https://arxiv.org/abs/1410.0846

Number 5: never store PII where you rest at

https://www2.census.gov/foia/ds_policies/ds007.pdf

Number 6: that goddamn STATA*? Dead it You think a crackhead paying you back, shit forget it! (*STATA/SAS/SPSS)

https://thomaswdinsmore.com/2018/03/07/sas-is-on-the-brink-of-something/#comment-10243

Numero Siete: this rule is so underrated Keep your training and test set completely seperated Money and blood don’t mix like two…

https://statistics.stanford.edu/research/estimating-error-rate-prediction-rule-improvements-cross-validation

Number 8, always keep survey weights on you. Them cats that squeeze your guns can ask what population your stats generalize to

https://www.statschat.org.nz/2016/10/25/oversampling/

Number 9 shoulda been Number 1 to me: If you ain’t gettin’ representative samples stay the fuck from police data

https://www.vox.com/2016/7/11/12148452/police-shootings-racism-study

Number 10, a strong word called Bayes-i-an Strictly for live men, not for freshmen

https://projecteuclid.org/euclid.aos/1176346785

 

#RIPBIGGIE

To leave a comment for the author, please follow the link and comment on their blog: rstats – MikeJackTzen.

R-bloggers.com offers daily e-mail updates about R news and tutorials on topics such as: Data science, Big Data, R jobs, visualization (ggplot2, Boxplots, maps, animation), programming (RStudio, Sweave, LaTeX, SQL, Eclipse, git, hadoop, Web Scraping) statistics (regression, PCA, time series, trading) and more...



If you got this far, why not subscribe for updates from the site? Choose your flavor: e-mail, twitter, RSS, or facebook...

Comments are closed.

Search R-bloggers


Sponsors

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)