Articles by r on Joel Soroos

Kmeans Clustering of Penguins

September 25, 2020 | r on Joel Soroos

In today's blog, I explore k-means clustering capabilities in R including algorithms, visualizations and methodologies to identify the optimal number of clusters ("k"). K-means clustering is an unsupervised machine learning tool to group s...
[Read more...]

Kmeans Clustering of Penguins

September 19, 2020 | r on Joel Soroos

In today's blog, I explore k-means clustering capabilities in R including algorithms, visualizations and methodologies to identify the optimal number of clusters ("k"). K-means clustering is an unsupervised machine learning tool to group s...
[Read more...]

UpSet Charts Comparison

July 6, 2020 | r on Joel Soroos

Motivation For the May 26 2020 R4DS Tidy Tuesday data set, I explored UpSet charts, which simplify visualizing overlap of large numbers of sets. Laura Ellis has a useful explanation of how Venn and Euler diagrams become unwieldy for intersections of greater than two sets. I analyzed which ingredients, as well ...
[Read more...]

Spotify Song Analytics

May 1, 2020 | r on Joel Soroos

Motivation Spotify is an amazing app to play favorite music, discover new music and rediscover old favorites. In addition, the Spotify API provides free access to a wide array of data on songs, which R users can leverage via Charlie Thompson's spot...
[Read more...]

Favorite links

April 21, 2020 | r on Joel Soroos

Learning R “R for Data Science” — seminal Tidyverse book by Hadley Wickham & Garrett Grolemund “Learning R Step by Step” - hard copy book by Richard Cotton Statistics with R — online Duke courses via Coursera R Programmi... [Read more...]

Tidy Tuesday Functions

February 8, 2020 | r on Joel Soroos

    1. Most widely learned functions are in ggplot2 and dplyr packages. package n percent ggplot2 16 0.176 dplyr 9 0.099 base 6 0.066 stringr 5 0.055 ggimage 3 0.033 ggraph 3 0.033 lubr... [Read more...]

Visualizations

November 14, 2019 | r on Joel Soroos

A collection of data visualizations I have built for fun using the R programming language. All visualizations are produced via code without post-processing. The examples source data from the weekly R4DS #TidyTuesday project, which is a community fo...
[Read more...]

Favorite R Packages

November 14, 2019 | r on Joel Soroos

Sourcing readr - fast and friendly way to read rectangular data (i.e. csv, tsv) RODBC - SQL querying against relational databases readxl - importing Excel files including individual worksheets rvest - tools for working with http connections rio - ... [Read more...]

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)