Articles by Johannes B. Gruber on Johannes B. Gruber

Get all your packages back on R 4.0.0

April 26, 2020 | 0 Comments

R 4.0.0 was released on 2020-04-24. Among the many news two stand out for me: First, R now uses stringsAsFactors = FALSE by default, which is especially welcome when reading in data (e.g., via read.csv) and when constructing data.frames. The second news that caught my eye was that ... [Read more...]

(Much) faster unnesting with data.table

October 20, 2019 | 0 Comments

Today I was struggling with a relatively simple operation: unnest() from the tidyr package. What it’s supposed to do is pretty simple. When you have a data.frame where one or multiple columns are lists, you can unlist these columns while duplicating the information in other columns if the ... [Read more...]

Introducing rwhatsapp

September 22, 2019 | 0 Comments

I’m happy to announce that rwhatsapp is now on CRAN. After being tested by users on GitHub for a year now, I decided it is time to make the package available to a wider audience. The goal of the package is to make working with ‘WhatsApp’ chat logs as ...
[Read more...]

Introducing rwhatsapp

September 22, 2019 | 0 Comments

I’m happy to announce that rwhatsapp is now on CRAN. After being tested by users on GitHub for a year now, I decided it is time to make the package available to a wider audience. The goal of the package is to make working with ‘WhatsApp’ chat logs as ...
[Read more...]

You R my Valentine!

February 13, 2019 | 0 Comments

Today is Valentine’s Day. And since both I and my sweetheart are R enthusiasts, here is how to say “I love you” using a statistical programming language: library("dplyr") library("gganimate") library("ggplot2") hrt_dat % bind_rows(data.frame(t = rep(max(.$t), 300))) %__% mutate(xhrt = 16 * sin(t) ^ 3, yhrt = 13 * cos(...
[Read more...]

You R my Valentine!

February 13, 2019 | 0 Comments

Today is Valentine’s Day. And since both I and my sweetheart are R enthusiasts, here is how to say “I love you” using a statistical programming language: library("dplyr") library("gganimate") library("ggplot2") hrt_dat % bind_rows(data.frame(t = rep(max(.$t), 300))) %__% mutate(xhrt = 16 * sin(t) ^ 3, yhrt = 13 * cos(...
[Read more...]

Smarter Wordclouds

February 1, 2019 | 0 Comments

Some time ago, I saw a presentation by Wouter van Atteveldt who showed that wordclouds aren’t necessarily stupid. I was amazed since wordclouds were one of the first things I ever did in R and they are still often shown in introductions to text analysis. But the way they ...
[Read more...]

Smarter Wordclouds

February 1, 2019 | 0 Comments

Some time ago, I saw a presentation by Wouter van Atteveldt who showed that wordclouds aren’t necessarily stupid. I was amazed since wordclouds were one of the first things I ever did in R and they are still often shown in introductions to text analysis. But the way they ...
[Read more...]

(Mis)using Discourse Network Analyzer for manual coding

September 11, 2018 | 0 Comments

For my PhD project, I want to use Supervised Machine Learning (SML) to replicate my manual coding efforts onto a larger data set. That means, however, that I need to put in some manual coding effort before the SML algorithms can do their magic! I used ...
[Read more...]

(Mis)using Discourse Network Analyzer for manual coding

September 11, 2018 | 0 Comments

For my PhD project, I want to use Supervised Machine Learning (SML) to replicate my manual coding efforts onto a larger data set. That means, however, that I need to put in some manual coding effort before the SML algorithms can do their magic! I used ...
[Read more...]

(Mis)using Discourse Network Analyzer for manual coding

September 11, 2018 | 0 Comments

For my PhD project, I want to use Supervised Machine Learning (SML) to replicate my manual coding efforts onto a larger data set. That means, however, that I need to put in some manual coding effort before the SML algorithms can do their magic! I used ... [Read more...]

LexisNexisTools. My first `R` package

May 18, 2018 | 0 Comments

My PhD supervisor once told me that everyone doing newspaper analysis starts by writing code to read in files from the ‘LexisNexis’ newspaper archive. However, while I do recommend this exercise, not everyone has the time. These are the first words of the introduction to my first R package, LexisNexisTools. ...
[Read more...]

Highly Similar Texts

May 16, 2018 | 0 Comments

Available solutions: the accurate one Available solutions: the fast one Divide and conquer Implementation in LexisNexisTools A while ago I was building a database of newspaper articles retrieved from LexisNexis for a research project in which I was working as a research assistant. At some point we noticed that we ... [Read more...]

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)