Articles by R on Stats and R

The 9 concepts and formulas in probability that every data scientist should know

March 2, 2020 | R on Stats and R

What is probability? 1. A probability is always between 0 and 1 2. Compute a probability 3. Complement of an event 4. Union of two events 5. Intersection of two events 6. Independence of two events 7. Conditional probability Bayes’ theorem Example 8. Accuracy measures False negatives False positives Sensitivity Specificity Positive predictive value Negative predictive value 9. Counting techniques Multiplication ...
[Read more...]

The 9 concepts and formulas in probability that every data scientist should know

March 2, 2020 | R on Stats and R

What is probability? 1. A probability is always between 0 and 1 2. Compute a probability 3. Complement of an event 4. Union of two events 5. Intersection of two events 6. Independence of two events 7. Conditional probability Bayes’ theorem Example 8. Accuracy measures False negatives False positives Sensitivity Specificity Positive predictive value Negative predictive value 9. Counting techniques Multiplication ...
[Read more...]

Student’s t-test in R and by hand: how to compare two groups under different scenarios

February 27, 2020 | R on Stats and R

Introduction Null and alternative hypothesis Hypothesis testing Different versions of the Student’s t-test How to compute Student’s t-test by hand? Scenario 1: Independent samples with 2 known variances Scenario 2: Independent samples with 2 equal but unknown variances Scenario 3: Independent samples with 2 unequal and unknown variances Scenario 4: Paired samples where the variance ...
[Read more...]

Student’s t-test in R and by hand: how to compare two groups under different scenarios

February 27, 2020 | R on Stats and R

Introduction Null and alternative hypothesis Hypothesis testing Different versions of the Student’s t-test How to compute Student’s t-test by hand? Scenario 1: Independent samples with 2 known variances Scenario 2: Independent samples with 2 equal but unknown variances Scenario 3: Independent samples with 2 unequal and unknown variances Scenario 4: Paired samples where the variance ...
[Read more...]

Getting started in R markdown

February 17, 2020 | R on Stats and R

R Markdown: what, why and how? Before you start Components of a .Rmd file YAML header Code chunks Text Code inside text Highlight text like it is code Images Tables Additional notes and useful resources If you have spent some time writing cod...
[Read more...]

Getting started in R markdown

February 17, 2020 | R on Stats and R

R Markdown: what, why and how? Before you start Components of a .Rmd file YAML header Code chunks Text Code inside text Images Tables Additional notes and useful resources Photo by Jon Tyson If you have spent some time writing code in R, you probably have heard of generating dynamic ...
[Read more...]

The complete guide to clustering analysis: k-means and hierarchical clustering by hand and in R

February 12, 2020 | R on Stats and R

What is clustering analysis? Application 1: Computing distances Solution k-means clustering Application 2: k-means clustering Data kmeans() with 2 groups Quality of a k-means partition nstart for several initial centers and better stability kmeans() with 3 groups Optimal number of clusters Elbow method Silhouette method Gap statistic method NbClust() Visualizations Manual application and verification ...
[Read more...]

The complete guide to clustering analysis: k-means and hierarchical clustering by hand and in R

February 12, 2020 | R on Stats and R

What is clustering analysis? Application 1: Computing distances Solution k-means clustering Application 2: k-means clustering Data kmeans() with 2 groups Quality of a k-means partition nstart for several initial centers kmeans() with 3 groups Manual application and verification in R Solution by hand Solution in R Hierarchical clustering Application 3: hierarchical clustering Data Solution by ...
[Read more...]

Chi-square test of independence in R

January 26, 2020 | R on Stats and R

Introduction Data Chi-square test of independence in R Conclusion and interpretation Combination of plot and statistical test Introduction This article explains how to perform the Chi-square test of independence in R and how to interpret its r...
[Read more...]

Chi-square test of independence in R

January 26, 2020 | R on Stats and R

Introduction Example Data Chi-square test of independence Conclusion and interpretation Introduction This article explains how to perform the Chi-square test of independence in R and how to interpret its results. To learn more about how the test works and how to do it by hand, I invite you to read ...
[Read more...]

How to create a timeline of your CV in R

January 25, 2020 | R on Stats and R

Introduction Minimal reproducible example How to personalize it Introduction In this article, I show how to create a timeline of your CV in R. A CV timeline illustrates key information about your education, work experiences and extra activities. The main advantage of CV timelines compared to regular CV is that ...
[Read more...]
1 2 3 4

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)