What kind of programmer are you? Stack Exchange can predict it, Shiny can graph it

February 1, 2015
By

(This article was first published on Variance Explained, and kindly contributed to R-bloggers)

I was impressed by Stack Exchange’s recent announcement of their machine learning system, Providence, that guesses what kind of programmer you are based on your Stack Overflow traffic. Stack Exchange uses this to choose what questions to show you on their homepage and to recommend jobs to you in their Careers ads.

One admirable feature is that Stack Exchange allows any user to download the algorithm’s personalized predictions about him/her as a JSON file. The JSON report classifies you according to several developer profiles (Web/Mobile/Other, BackEnd/FrontEnd/FullStack), guesses what technologies you are familiar with (DesktopOSX, FullStackWebRubyOnRails etc) and reports the information on tag views that was used to construct it. This is a great move to demystify the data mining process.

The only thing missing from this report is a way to visualize these predictions. And that’s a role that RStudio’s Shiny fills easily! With a mix of Shiny, rjson, dplyr, and ggplot2, I set up a page that lets anyone visualize their developer activity and predicted profile by uploading the JSON file. Check out the app here!

Let’s see, for instance, how Stack Exchange did at profiling me:

My developer profile

It got the StatsOrMathBackground right, and I can see how my Python and Django activity means I fit the profile of a web developer. I’m not sure why my MajorDeveloperKinds was 2/3 “Web” and 1/3 “Mobile” (when between those three options I certainly fit “Other”” the most), but it’s certainly food for thought.

Try it yourself! (And check out the R code here).

To leave a comment for the author, please follow the link and comment on their blog: Variance Explained.

R-bloggers.com offers daily e-mail updates about R news and tutorials on topics such as: Data science, Big Data, R jobs, visualization (ggplot2, Boxplots, maps, animation), programming (RStudio, Sweave, LaTeX, SQL, Eclipse, git, hadoop, Web Scraping) statistics (regression, PCA, time series, trading) and more...



If you got this far, why not subscribe for updates from the site? Choose your flavor: e-mail, twitter, RSS, or facebook...

Comments are closed.

Sponsors

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)