covid19italy v0.2.0 is now on CRAN

[This article was first published on Rami Krispin, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

Last week I pushed an update of the covid19italy package to CRAN (v0.2.0). The covid19italy R package provides a tidy format dataset of the 2019 Novel Coronavirus COVID-19 (2019-nCoV) pandemic outbreak in Italy. The package includes the following three datasets:

  • italy_total – daily summary of the outbreak on the national level
  • italy_region – daily summary of the outbreak on the region level
  • italy_province – daily summary of the outbreak on the province level

More details about the datasets available on the following vignette

Data source: Italy Department of Civil Protection

Main updates in version 0.2.0:

  • Data structure changes – updating changes in the raw data, such as adding new variables that related to the number of tests performed on both region and national level. More details available on the package changelog
  • Geospatial columns – added the region_spatial and province_spatial columns for the italy_region and italy_province datasets, respectively. Those columns have the corresponding naming convention of Italy regions and province as in the output of the ne_states function from the rnaturalearth package. See example below
  • Cron job – the data is now automatically refreshed on a daily basis with the use of Github Actions to run cron job

Keep the data updated

While the covid19italy CRAN version is updated every month or two, the Github (Dev) version is updated on a daily bases. The update_data function enables to overcome this gap and keep the installed version with the most recent data available on the Github version:



Note: must restart the R session to have the updates available

Plotting cases with a choropleth map

As mentioned above, one of the new features in v0.2.0 is the geospatial columns on the italy_region and italy_province. Those columns are using the naming conventions as used by the rnaturalearth package with the ne_states function. Therefore, it allows a simple and quick merge of the data with the geometric data of the regions and provinces of Italy. The following example demonstrates the use-case of this feature along with the rnaturalearth and mapview packages to create a choropleth map of the total confirmed cases in Italy by province. We will start with pulling the geometric data of Italy with the ne_states function and merge it with the italy_province dataset:


italy_map <- ne_states(country = "Italy", returnclass = "sf")  %>% 
  select(province = name, region, geometry)  %>%
  left_join(italy_province %>% 
              filter(date == max(date)), # subseting for the most recent day
            by = c("province" = "province_spatial"))

Next, we will use the mapview package to plot the covid19 confirmed cases by Italy provinces:

italy_map %>%
  mapview(zcol = "total_cases")

To leave a comment for the author, please follow the link and comment on their blog: Rami Krispin. offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)