Blog Archives

Ideally, this shouldn’t be happening for such a deep network.

February 5, 2019
By
Ideally, this shouldn’t be happening for such a deep network.

Ideally, this shouldn’t be happening for such a deep network.

Read more »

Web Scraping Google Sheets with RSelenium

January 28, 2019
By
Web Scraping Google Sheets with RSelenium

Photo by freestocks.org on UnsplashI love to learn new things and one of ways I learn best is by doing. Also it’s been said that you never fully understand a topic until you are able to explain it , I think blogging is a low barrier to explaining things. Someone I met at a local data science meetup in Montréal wanted help...

Read more »

10 Tips for Choosing the Optimal Number of Clusters

January 27, 2019
By
10 Tips for Choosing the Optimal Number of Clusters

Photo by Pakata Goh on UnsplashClustering is one of the most common unsupervised machine learning problems. Similarity between observations is defined using some inter-observation distance measures or correlation-based distance measures. There are 5 classes of clustering methods: + Hierarchical Clustering+ Partitioning Methods (k-means, PAM, CLARA)+ Density-Based Clustering+ Model-based Clustering+ Fuzzy Clustering My desire to write this post came mainly from reading about the...

Read more »

Great post Yash!

December 10, 2018
By
Great post Yash!

Great post Yash! For those readers interested in getting data from the fitbit API using R I’ve documented the process here: https://towardsdatascience.com/the-gamification-of-fitbit-how-an-api-provided-the-next-level-of-training-eaf7b267af00

Read more »

The Gamification Of Fitbit: How an API Provided the Next Level of tRaining

November 10, 2018
By
The Gamification Of Fitbit: How an API Provided the Next Level of tRaining

“Anyone can look for fashion in a boutique or history in a museum. The creative explorer looks for history in a hardware store and fashion in an airport” — Robert Wieder As much as I like Medium for it’s ability to reach an audience I prefer the customization afforded by github.io. If you want to see a far-fancier version of this post please...

Read more »

How to Highlight 3D Brain Regions

October 31, 2018
By
How to Highlight 3D Brain Regions

Recently, I was reading Howard et. al., (2018) “Genome-wide meta-analysis of depression in 807,553 individuals identifies 102 independent variants with replication in a further 1,507,153 individuals” and saw a really cool 3D visualization of highlighted brain regions associated with depression: Source: https://goo.gl/7rY5KVAfter an exhaustive search I couldn’t find any reference to how this was done in the methods or supplementary...

Read more »

The “Gold Standard” of Data Science Project Management

October 6, 2018
By
The “Gold Standard” of Data Science Project Management

The “Gold Standard” for Data Science Project ManagementThe inspiration for this post came most recently from a slide-deck by Ming Tang, a Bioinformatician at Harvard, and a new Chromebook Data Science course offered by Jeffery Leek from John Hopkin...

Read more »

Great post!

August 16, 2018
By
Great post!

Great post!I wanted to mention that although many previous studies have used the area under receiver operating characteristic curve (auROC) statistic to benchmark the precision, it misleads evaluators when the test data is (highly) imbalanced see: PLOS...

Read more »

Hi Pawel, I’m glad you enjoyed it.

July 25, 2018
By
Hi Pawel, I’m glad you enjoyed it.

Hi Pawel, I’m glad you enjoyed it. I was trying to play around with facet_grid() earlier but I guess I didn’t stumble upon the proper parameters. Your suggestion works perfectly; not only does it keep each grid x-axis width proportional to its length, but it also keeps appropriate space-between-variables. Thank you for sharing that!

Read more »

Recreating (more) data visualizations from the book “Knowledge is Beautiful”: Part IV

July 16, 2018
By
Recreating (more) data visualizations from the book “Knowledge is Beautiful”: Part IV

Welcome to the last part of the series where I recreate data visualizations in R from the book Knowledge is Beautiful by David McCandless.Links to part I, II, III of the series can be found here.Plane CrashesThis dataset will be used for a couple of visualizations.The first visualization is a stacked-barplot showing causes of crashes for every plane crash from...

Read more »

Search R-bloggers


Sponsors

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)