In Praise of Substantive Expertise in Data Science

November 14, 2014
By

(This article was first published on Engaging Market Research, and kindly contributed to R-bloggers)

Substantive expertise makes it into the Data Science Venn Diagram from DataCamp’s infographic on how to become a data scientist. It’s one of the three circles of equal size along with programming and statistics. Regrettably, substantive expertise is never mentioned in the definition of a data scientist as “someone who is better at statistics than any software engineer and better at software engineering than any statistician.” And it gets no step. Statistics is the first step, and the remaining steps cover programming in all its varying forms. “Alas, poor Substance! I knew him, DataCamp.”

All of this, of course, is to be taken playfully. I have no quarrel with any of DataCamp’s 8-step program. I only ask that we recognize that there are three circles of equal value. Some of us come to data science with substantive expertise and seeking new models for old problems. Some even contribute libraries applying those models in their particular areas of substantive expertise. R provides a common language through which we can visit foreign disciplines and see the same statistical models from a different perspective.

John Chambers reminds us in his UseR! 2014 keynote address that R began as a “user-centric scientific software tool” providing “an interface to the very best numerical algorithms.” Adding an open platform for user-submitted packages, R also becomes the interface to a diverse range of applications. This is R’s unique selling proposition. It is where one goes for new ways of seeing.

To leave a comment for the author, please follow the link and comment on their blog: Engaging Market Research.

R-bloggers.com offers daily e-mail updates about R news and tutorials on topics such as: Data science, Big Data, R jobs, visualization (ggplot2, Boxplots, maps, animation), programming (RStudio, Sweave, LaTeX, SQL, Eclipse, git, hadoop, Web Scraping) statistics (regression, PCA, time series, trading) and more...



If you got this far, why not subscribe for updates from the site? Choose your flavor: e-mail, twitter, RSS, or facebook...

Comments are closed.

Search R-bloggers


Sponsors

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)