Blog Archives

Manipulating strings with the {stringr} package

Manipulating strings with the {stringr} package

This blog post is an excerpt of my ebook Modern R with the tidyverse that you can read for free here. This is taken from Chapter 4, in which I introduce the {stringr} package. Manipulate strings with {stringr} {stringr} contains functions to manipulate strings. In Chapter 10, I will teach you about regular expressions, but the functions contained in {stringr} allow you to already...

Read more »

Building a shiny app to explore historical newspapers: a step-by-step guide

Building a shiny app to explore historical newspapers: a step-by-step guide

Introduction I started off this year by exploring a world that was unknown to me, the world of historical newspapers. I did not know that historical newspapers data was a thing, and have been thoroughly enjoying myself exploring the different datasets published by the National Library of Luxembourg. You can find the data here. In my first blog post, I analyzed data from L’indépendence Luxembourgeoise....

Read more »

Using Data Science to read 10 years of Luxembourguish newspapers from the 19th century

Using Data Science to read 10 years of Luxembourguish newspapers from the 19th century

I have been playing around with historical newspaper data (see here and here). I have extracted the data from the largest archive available, as described in the previous blog post, and now created a shiny dashboard where it is possible to visualize the most common words per article, as well as read a summary of each article. The summary was made using a method called...

Read more »

Making sense of the METS and ALTO XML standards

Making sense of the METS and ALTO XML standards

Last week I wrote a blog post where I analyzed one year of newspapers ads from 19th century newspapers. The data is made available by the national library of Luxembourg. In this blog post, which is part 1 of a 2 part series, I extract data from the 257gb archive, which contains 10 years of publications of the L’Union, another 19th century Luxembourguish...

Read more »

Looking into 19th century ads from a Luxembourguish newspaper with R

Looking into 19th century ads from a Luxembourguish newspaper with R

The national library of Luxembourg published some very interesting data sets; scans of historical newspapers! There are several data sets that you can download, from 250mb up to 257gb. I decided to take a look at the 32gb “ML Starter Pack”. It contains high quality scans of one year of the L’indépendence Luxembourgeoise (Luxembourguish independence) from the year 1877. To make life easier...

Read more »

R or Python? Why not both? Using Anaconda Python within R with {reticulate}

December 29, 2018
By
R or Python? Why not both? Using Anaconda Python within R with {reticulate}

This short blog post illustrates how easy it is to use R and Python in the same R Notebook thanks to the {reticulate} package. For this to work, you might need to upgrade RStudio to the current preview version. Let’s start by importing {reticulate}: library(reticulate) {reticulate} is an RStudio package that provides “a comprehensive set of tools for interoperability between Python and R”. With...

Read more »

Some fun with {gganimate}

December 26, 2018
By
Some fun with {gganimate}

Your browser does not support the video tag. In this short blog post I show you how you can use the {gganimate} package to create animations from {ggplot2} graphs with data from UNU-WIDER. WIID data Just before Christmas, UNU-WIDER released a new edition of their World Income Inequality Database: *NEW #DATA*We’ve just released a new version of the World Income Inequality Database.WIID4 includes #data...

Read more »

Objects types and some useful R functions for beginners

December 23, 2018
By
Objects types and some useful R functions for beginners

This blog post is an excerpt of my ebook Modern R with the tidyverse that you can read for free here. This is taken from Chapter 2, which explains the different R objects you can manipulate as well as some functions to get you started. Objects, types and useful R functions to get started All objects in R have a given type. You...

Read more »

Using the tidyverse for more than data manipulation: estimating pi with Monte Carlo methods

December 20, 2018
By
Using the tidyverse for more than data manipulation: estimating pi with Monte Carlo methods

This blog post is an excerpt of my ebook Modern R with the tidyverse that you can read for free here. This is taken from Chapter 5, which presents the {tidyverse} packages and how to use them to compute descriptive statistics and manipulate data. In the text below, I show how you can use the {tidyverse} functions and principles for the estimation of...

Read more »

Manipulate dates easily with {lubridate}

December 14, 2018
By
Manipulate dates easily with {lubridate}

This blog post is an excerpt of my ebook Modern R with the tidyverse that you can read for free here. This is taken from Chapter 5, which presents the {tidyverse} packages and how to use them to compute descriptive statistics and manipulate data. In the text below, I scrape a table from Wikipedia, which shows when African countries gained independence from other...

Read more »

Search R-bloggers


Sponsors

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)