Articles by

Beautiful Visualizations in R

June 12, 2018 |

I recently discovered the R Graph Gallery, where users can share the beautiful visualizations they've created using R and its various libraries (especially ggplot2). One of my favorite parts about this gallery is a section called Art From Data, in whic...
[Read more...]

Statistics Sunday: Creating Wordclouds

June 10, 2018 |

Cloudy with a Chance of Words Lots of fun projects in the works, so today's post will be short - a demonstration on how to create wordclouds, both with and without sentiment analysis results. While I could use song lyrics again, I decided to use a different dataset that comes ...
[Read more...]

Statistics Sunday: Two R Packages to Check Out

May 27, 2018 |

I'm currently out of town and not spending as much time on my computer as I have over the last couple months. (It's what happens when you're the only one in your department at work and also most of your hobbies involve a computer.) But I wanted to write up ...
[Read more...]

How Has Taylor Swift’s Word Choice Changed Over Time?

May 22, 2018 |

How Has Taylor Swift's Word Choice Changed Over Time?Sunday night was a big night for Taylor Swift - not only was she nominated for multiple Billboard Music Awards; she took home Top Female Artist and Top Selling Album. So I thought it was a good time for some more ...
[Read more...]

What Makes a Song (More) Popular

May 18, 2018 |

Earlier this week, the Association for Psychological Science sent out a press release about a study examining what makes a song popular:Researchers Jonah Berger of the University of Pennsylvania and Grant Packard of Wilfrid Laurier University were interested in understanding the relationship between similarity and success. In a recent ...
[Read more...]

Statistics Sunday: Tokenizing Text

May 6, 2018 |

Statistics Sunday: Tokenizing TextI recently started working my way through Text Mining with R: A Tidy Approach by Julia Silge and David Robinson. There are many interesting ways text analysis can be used, not only for research interests but for marketing and business insights. Today, I'd like to introduce one ...
[Read more...]

Statistical Sins: Is Your Classification Model Any Good?

May 2, 2018 |

Prediction with Binomial RegressionApril A to Z is complete! We now return to your regularly scheduled statistics blog posts! Today, I want to talk about an issue I touched on during A to Z: using regression to predict values and see how well your model is doing.Specifically, I talked ...
[Read more...]

Z is for Z-Scores and Standardizing

April 30, 2018 |

Z is for Z-Scores and StandardizingLast April, I wrapped up the A to Z of Statistics with a post about Z-scores. It seems only fitting that I'm wrapping this April A to Z with the same topic. Z-scores are frequently used, sometimes when you don't even realize it. When you ...
[Read more...]

Statistics Sunday: Conducting Meta-Analysis in R

April 29, 2018 |

Here it is, everyone! The promised 4th post on meta-analysis, and my second video for Deeply Trivial! In this video, I walk through conducting a basic meta-analysis, both fixed and random effects, in the metafor package:See these previous posts and lin... [Read more...]

Y is for Ys, Y-hats, and Residuals

April 28, 2018 |

Y is for Ys, Y-hats, and Residuals When working with a prediction model, like a linear regression, there are a few Ys you need to concern yourself with: the ys (observed outcome variable), the y-hats (predicted outcome variables based on the equation), and the residuals (y minus y-hat). Today, I'll ...
[Read more...]

X is for By

April 27, 2018 |

X is for ByToday's post will be rather short, demonstrating a set of functions from the psych package, which allows you to conduct analysis by group. These commands add "By" to the end of existing functions. But first, a word of caution: With great power comes great responsibility. This function ...
[Read more...]

W is for (Meta-Analysis) Weights

April 26, 2018 |

Weights in Meta-AnalysisYesterday, I talked about the variance calculation for meta-analysis, which is based on sample size - the exact calculation varies by the type of effect size you're using. It's a good idea to inspect those variance calculations. There are many places where your numbers for the meta-analysis can ...
[Read more...]

V is for (Meta-Analysis) Variance

April 25, 2018 |

Variance in Meta-Analysis For the letter E, I introduced the metafor package to compute effect sizes. That is, you provide a data frame with the study information and the data needed to compute the effect size(s), and metafor does that for you. But what I didn't mention is that ...
[Read more...]

U is for (Data From) URLs

April 24, 2018 |

Working with URLs in RUp to now, we've been working with files saved locally on your computer. But that limits you to files that can be easily saved to your computer and, up to now, structured data. As we move from pure statistics to a data science approach, more and ... [Read more...]

T is for tibble

April 23, 2018 |

T is for TibbleFor the letter D, I introduced data frames, a built-in R object type. But as I've learned more about R and, in particular, the tidyverse - most recently when I finally started reading Text Mining with R: A Tidy Approach - I learned about a more modern ...
[Read more...]

Statistics Sunday: Using semPlot

April 22, 2018 |

using semPlot with Facebook ModelsToday's post will be mostly demonstration, but I'll build on some of the things I covered in yesterday's semPlot post. This month, I've blogged about two SEM models: confirmatory factor analysis and latent variable path analysis. Using the models from those posts, I'll show how to ...
[Read more...]
1 2 3

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)