openwashdata 101

[This article was first published on openwashdata, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

Welcome! We are so happy that you join the openwashdata community!

While our common goal is more accessible data in the WASH sector, we do have different backgrounds and skill-sets. So feel free to skip some parts that are familiar to you. On the other hand, if you find some terms confusing, like “R programming” or “GitHub”, stay tuned by subcribing to our newsletter or get in touch. We will provide more detailed posts about open data practices and opportunities for training in the future.

1. Access and use data

You could get involved with the openwashdata community by using our published data:

  • If you are a researcher, you may analyze and cite our data in your scientific work.
  • If you are a lecturer, consider to integrate our data into your training and course materials.
  • If you do business in the WASH sector, you may refer to some of the updated data for your business cases.

The development team is constantly working on organizing, cleaning, and releasing new datasets. The following sections introduce two ways to use the published data.

1.1 Let’s do R, why not 🥳

If you have worked with R programming1, we are actively developing new R data packages. You could access the data by installing the package in R and start from there. For example, we have released the dataset package wasteskipsblantyre that contains locations of the waste skips in Blantyre, Malawi from the year 2021.

```{r}
# install devtools package that facilitates installation from GitHub
install.packages("devtools")

# Install data package from GitHub
devtools::install_github("openwashdata/wasteskipsblantyre")

# Import data package
library(wasteskipsblantyre)

# Check documentation
?wasteskipsblantyre
```

Now you have successfully loaded the data and can start to explore this dataset with your creativity!

1.2 Nah, just give me the files 😈

For the published datasets, we also provide the csv files (.csv) and the spreadsheets (.xlsx) that you can directly download from the associated website. For example, with the wasteskipsblantyre dataset, you may find the files as shown in the following figure2.

screenshot of downloading spreadsheet data

We are opinionated to discourage this direction because the analysis work with the data might be less reproducible. Nevertheless, for the moment, we still think it’s a good way to make data as accessible as possible.

1.3 Cite openwashdata

All datasets by default are released under CC-BY license. When using openwashdata, consider to cite the data package to acknowledge data collectors and maintainers. Each dataset has its citation information on the documentation website. For example, you may find how to cite package wasteskipsblantyre here.

2. Share your data with openwashdata

You got some amazing data and love to share? Great! Become a contributor by getting started with some tools and submit your data ideas.

You can also always get in touch with us via Email.

2.1 Join platforms for open data practices

We probably all have some social media to share life updates. For an open data community, likewise, we need tools to help us share, track, and communicate progress. Hence, we have picked widely used tools as part of the starter-kit to build openwashdata.

  • ORCID iD: a personal identifier of scholarly contribution, like a business card.
  • GitHub: our digital headquarter of openwashdata datasets.
  • Matrix: an instant message chatroom for the openwashdata community, like Slack or Messenger.

ORCID iD

  1. Go to Register an ORCID iD

    1. If your institute has an affiliation, you could choose “Access through your institution” to register and sign in.

    2. Otherwise, you may opt in th “Google” or “Facebook” sign-in options.

    3. If the above-mentioned options are not for you, click “Register Now” and follow the instructions.

  2. Save your ORCID iD

GitHub

  1. Go to Sign up for Github and follow the instructions to get an account.
  2. Open openwashdata GitHub organization and click “Follow” on the upper right corner to connect with us on GitHub.

Matrix

  1. If you do not have a Matrix account yet, follow these instructions to set up!
  2. Join our chatroom with a direct link to: openwashdata-lobby
  3. Say “hi” in openwashdata-lobby!

2.2 Submit a data idea

Now that you have a GitHub account, you can submit your data idea and collaborate with us by:

  • Log into GitHub
  • Go to our data idea tracker
  • Follow the instructions to submit your idea

We hope this post not only helps you with a smooth start to collaborate with us in the openwashdata community, but also benefit you with long-term open science practice. Join the Matrix chatroom to get support and advice from now on.

Footnotes

  1. “A free software environment for statistical computing and graphics.”↩︎

  2. The figure is a screenshot from wasteskipsblantyre package website and you may also want to check out this awesome example data analysis article.↩︎

To leave a comment for the author, please follow the link and comment on their blog: openwashdata.

R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)