Site icon R-bloggers

Excess Deaths during the 1st Wave of Covid-19

[This article was first published on R – Predictive Hacks, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

Abstract

Our goal is to provide some summary statistics of deaths across countries during the 1st Wave of Covid-19 and to compare these numbers with the corresponding ones of the previous years. This analysis is not scientific and we cannot drive any conclusion about the impact of Covid-19 since we need to take into consideration many other parameters and this is out of the scope of this analysis.

We still keep our character which is a Data Science focusing on R and Python and for that reason, we will work with R and we will share the code making the analysis to be reproducible.

Data Source

We found many difficulties to find structured, reliable and consistent data about deaths across countries. For this analysis we obtained the data from the Our World in Data and more particularly at the Excess Mortality using raw deaths counts and then we downloaded the excess-mortality-raw-death-counts.csv file.

Some notes about the data:


Summary Statistics by Country

The data is from 2015 up to 2020 and is referring to the same period (for example January to September). We will provide the following summary statistics by country:

Prepare the data:

library(tidyverse)

# https://ourworldindata.org/excess-mortality-covid?fbclid=IwAR1LZGFQGl_nXmVMMnhnXOkLzhYTHglGPGYBIKyxOQCP3L2HLr9H99XNgE0
df<-read.csv("excess-mortality-raw-death-count.csv")

# Rename the column names
names(df)<-c("Country", "Code", "Date", "Deaths_2020", "Deaths_AVG_2015_2019", 
             "Deaths_2015", "Deaths_2016", "Deaths_2017", "Deaths_2018", "Deaths_2019")


# Convert the Date to Date-Format
df$Date<-as.Date(df$Date)

US

Let’s provide the summary statistics of the US.

# Excess Deaths by Month in US 
# Create also a Date Month column where we truncate the date to the month
# Create also a month column which is the month
country<- df%>%filter(Code=="USA")%>%
  mutate(Date_Month=lubridate::floor_date(Date, unit = "month"), Month = lubridate::month(Date, label=TRUE))


country%>%select(-Deaths_AVG_2015_2019)%>%group_by(Month)%>%
  summarise(across(Deaths_2020:Deaths_2019, ~mean(.x, na.rm=TRUE)))
 

Table of Average Weekly Deaths by Month

Chart of Average Weekly Deaths by Month

country%>%select(-Deaths_AVG_2015_2019)%>%group_by(Month)%>%
  summarise(across(Deaths_2020:Deaths_2019, ~mean(.x, na.rm=TRUE)))%>%
  pivot_longer(-Month, names_to="Year", values_to="Total_Deaths")%>%
  ggplot(aes(x=Month, y=Total_Deaths, group=Year))+
  geom_line(aes(col=Year))+geom_point(aes(col=Year))+
  ggtitle("US: Avg Weekly Deaths by Month")+ylab("Avg Weekly Deaths within Month")+
  theme_minimal()
 

As we can see, in 2020 the deaths during January and February were very closed with the past years and then, from February onwards there is a significant increase.

Total Deaths per Year for the same period

country%>%select(-Deaths_AVG_2015_2019)%>%
  summarise(across(Deaths_2020:Deaths_2019, ~sum(.x, na.rm=TRUE)))
 

Percentage difference in Total deaths of 2020 compared to previous years

# % Increase in deaths 
country%>%select(-Deaths_AVG_2015_2019)%>%
  summarise(sum(Deaths_2020)/across(Deaths_2020:Deaths_2019, ~sum(.x, na.rm=TRUE))-1)
 

We can see that compared to 2019, there is an increase of 13.53% in deaths.


Summary Statistics for Other Countries

We provided a detailed example of how you can generate these statistics for a specific country. You can simply change the country code. Let’s provide the charts and the percentage of increased deaths compared to previous years for the other countries.

Italy


Spain


France


England and Wales


Sweden


Discussion

From the data above we may argue that for the countries that have many Covid-19 cases, there is a significant increase in raw deaths during the March-May period and then during the summer period the raw deaths for most the countries revert back to its “normal” levels. All of these countries above showed an increase in deaths compared to the previous years. For example, Spain has 17.76% more deaths compared to 2019 for the same period (Jan-Sep). The corresponding numbers for the rest countries are US 13.53%, Italy 12.35%, France 4.64%, England and Wales 15.62% and Sweden 9.81%.

Again we want to stress out that we cannot guarantee the validity of our data and we do not take into consideration other parameters. For example, The pandemic may result in fewer deaths from other causes – for instance, the mobility restrictions during the pandemic might lead to fewer deaths from road accidents. From the data above it seems that there is an “anomaly” in deaths during the 1st Wave of Covid-19. If this Covid-19 is a hoax according to our friends below, then what is this external factor which causes more deaths?

To leave a comment for the author, please follow the link and comment on their blog: R – Predictive Hacks.

R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.