Blog Archives

Chat with the rOpenSci team at upcoming meetings

You can find members of the rOpenSci team at various meetings and workshops around the world. Come say ‘hi’, learn about how our software packages can enable your research, or about our process for open peer software review and onboarding, how you can get connected with the community or tell us how we can help you do open and...

Read more »

Building Reproducible Data Packages with DataPackageR

Building Reproducible Data Packages with DataPackageR

Sharing data sets for collaboration or publication has always been challenging, but it’s become increasingly problematic as complex and high dimensional data sets have become ubiquitous in the life sciences. Studies are large and time consuming; data collection takes time, data analysis is a moving target, as is the software used to carry it out. In the vaccine space (where...

Read more »

What have these birds been studied for? Querying science outputs with R

What have these birds been studied for? Querying science outputs with R

In the second post of the series where we obtained data from eBird we determined what birds were observed in the county of Constance, and we complemented this knowledge with some taxonomic and trait information in the fourth post of the series. Now, we could be curious about the occurrence of these birds in scientific work. In this post, we will query the scientific literature and...

Read more »

All the Badges One Can Earn: Parsing Badges of CRAN Packages READMEs

All the Badges One Can Earn: Parsing Badges of CRAN Packages READMEs

A while ago we onboarded an exciting package, codemetar by Carl Boettiger. codemetar is an R specific information collector and parser for the CodeMeta project. In particular, codemetar can digest metadata about an R package in order to fill the terms recognized by CodeMeta. This means extracting information from DESCRIPTION but also from e.g. continuous integration badges in the README! In this note, we’ll take advantage of codemetar::extract_badges function to...

Read more »

In praise of Commonmark: wrangle (R)Markdown files without regex

You might have read my blog post analyzing the social weather of rOpenSci onboarding, based on a text analysis of GitHub issues. I extracted text out of Markdown-formatted threads with regular expressions. I basically hammered away at the issues using tools I was familiar with until it worked! Now I know there’s a much better and cleaner way, that I’ll present in this note. Read on...

Read more »

What are these birds? Complement occurrence data with taxonomy and traits information

What are these birds? Complement occurrence data with taxonomy and traits information

Thanks to the second post of the series where we obtained data from eBird we know what birds were observed in the county of Constance. Now, not all species’ names mean a lot to me, and even if they did, there are a lot of them. In this post, we shall use rOpenSci’s packages accessing taxonomy and trait data in order to summarize some...

Read more »

What’s this bird? Classify old natural history drawings with R

What’s this bird? Classify old natural history drawings with R

In this new post, we’re taking a break from modern birding data in our birder’s series… let’s explore gorgeous drawings from a natural history collection! Armed with rOpenSci’s packages binding powerful C++ libraries and open taxonomy data, how much information can we automatically extract from images? Maybe not much, but we’ll at least have explored image manipulation, optical character recognition (OCR), language detection, taxonomic name resolution...

Read more »

rgbif: seven years of GBIF in R

rgbif: seven years of GBIF in R

rgbif was seven years old yesterday! What is rgbif? rgbif gives you access to data from the Global Biodiversity Information Facility (GBIF) via their API. A samping of use cases covered in rgbif: Search for datasets Get metrics on usage of datasets Get metadata about organizations providing data to GBIF Search taxonomic names Get quick taxonomic name suggestions Search occurrences by taxonomic name/country/collector/etc. Download occurrences by taxonomic name/country/collector/etc. Fetch raster...

Read more »

What birds are observed near Radolfzell? Bird occurrence data in R

What birds are observed near Radolfzell? Bird occurrence data in R

Thanks to the first post of the series we know where to observe birds near Radolfzell’s Max Planck Institute for Ornithology, so we could go and do that! Or we can stay behind our laptops and take advantage of eBird, a fantastic bird sightings aggregator! As explained by Matt Strimas-Mackey in his recent blog post, “The eBird database currently contains over 500 million records of bird sightings,...

Read more »

Mongolite 2.0: GridFS, connection pooling, and more

This week version 2.0 of the mongolite package has been released to CRAN. Major new features in this release include support for MongoDB 4.0, GridFS, running database commands, and connection pooling. Mongolite is primarily an easy-to-use client to get data in and out of MongoDB. However it supports increasingly many advanced features like aggregation, indexing, map-reduce, streaming, encryption, and enterprise...

Read more »

Search R-bloggers


Sponsors

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)