Here you will find daily news and tutorials about R, contributed by over 573 bloggers.
There are many ways to follow us - By e-mail:On Facebook: If you are an R blogger yourself you are invited to add your own R content feed to this site (Non-English R bloggers should add themselves- here)

His analysis of the data led to the question: where did the source data come from in the first place? With some crowdsourced sleuthing, Christopher discovered the data comes from the first edition of the book Whisky Classified: Choosing Single Malts by Flavour by David Wishart. The story behind the data is quite interesting, and worth checking out if you're a whisky fan.

It turns out the data file Luba used comes from the first edition of the "Whisky Classified" book, and there were a few typos in the data to boot (for example, Bowmore had a Medicinal ranking of 1, but was actually a 2 in the book.) A commenter "Florin" at the Scotch and Ice Cream blog cleaned up the data and re-ran the analysis, and generated four slightly different clusters: peaty whiskies, ex-sherry whiskies, ex-bourbon / no peat whiskies, and whiskies with some ex-sherry blended in or with some peat. Extending the analysis to five clusters apparently succeeded in "separating the hard-core peated whiskies from the less-peated ones".

Just goes to show: with just 86 rows of data here, you don't always need "Big Data" to generate interesting analysis!

Related

To leave a comment for the author, please follow the link and comment on their blog: Revolutions.