Blog Archives

The Antarctic/Southern Ocean rOpenSci community

Antarctic/Southern Ocean science and rOpenSci Collaboration and reproducibility are fundamental to Antarctic and Southern Ocean science, and the value of data to Antarctic science has long been promoted. The Antarctic Treaty (which came into force in 1...

Read more »

Tesseract 4 is here! State of the art OCR in R!

Tesseract 4 is here! State of the art OCR in R!

Last week Google and friends released the new major version of their OCR system: Tesseract 4. This release builds upon 2+ years of hard work and has completely overhauled the internal OCR engine. From the tesseract wiki: Tesseract 4.0 includes a new neural network-based recognition engine that delivers significantly higher accuracy (on document images) than the previous versions, in return...

Read more »

Sharing the Recipe for rOpenSci’s Unconf Ice Breaker

Sharing the Recipe for rOpenSci’s Unconf Ice Breaker

While many people groan at the thought of participating in a group ice breaker activity, we’ve gotten consistent feedback from people who have been to recent rOpenSci unconferences. Best ice breaker ever! We’ve had lots of requests for a detailed description of how we do it. This post shares our recipe, including a script you can adapt, a reflection on its...

Read more »

Community Call – Working with images in R

Community Call – Working with images in R

rOpenSci’s software engineer / postdoc Jeroen Ooms will explain what images are, under the hood, and showcase several rOpenSci packages that form a modern toolkit for working with images in R, including opencv, av, tesseract, magick and pdftools. 🕘 Thursday, November 15, 2018, 10-11AM PST; 7-8PM CET (find your timezone) ☎️ Find all details for joining the call on our Community...

Read more »

pubchunks: extract parts of scholarly XML articles

pubchunks is a package grown out of the fulltext package. fulltext provides a single interface to many sources of full text scholarly articles. As part of the user flow in fulltext there is an extraction step where fulltext::chunks() pulls parts of articles out of XML format article files. As part of making fulltext more maintainable and focused on simply fetching articles, and realizing that...

Read more »

Parsing Metadata with R – A Package Story

Parsing Metadata with R – A Package Story

Every R package has its story. Some packages are written by experts, some by novices. Some are developed quickly, others were long in the making. This is the story of jstor, a package which I developed during my time as a student of sociology, working in a research project on the scientific elite within sociology. Writing the package has taught me many things...

Read more »

Distinguish yourself in CRAN person() with ORCID

Distinguish yourself in CRAN person() with ORCID

Proper identification of individuals is crucial for acknowledging and studying their scientific work, be it journal articles or pieces of software. In this tech note, one year after CRAN started supporting ORCIDs, we shall explain why and how to use unique author identifiers in DESCRIPTION files. Why use ORCIDs on CRAN? When analyzing the authorship of CRAN packages, one can look at authors’ names and email...

Read more »

The av Package: Production Quality Video in R

At rOpenSci we are developing on a suite of packages that expose powerful graphics and imaging libraries in R. Our latest addition is av – a new package for working with audio/video based on the FFmpeg AV libraries. This ambitious new project will become the video counterpart of the magick package which we use for working with images. install.packages("av") av::av_demo() The package...

Read more »

Community Call – Code Review in the Lab, or … How do you review code that accompanies a research project?

Community Call – Code Review in the Lab, or … How do you review code that accompanies a research project?

Do you have code that accompanies a research project or manuscript? How do you review and archive that code before you submit a paper? Our next Community Call will present different perspectives on this hot topic, with plenty of time for Q&A. What’s the culture of the group around feedback and code collaboration? What are the use cases? What are some practices...

Read more »

outcomerate: Transparent Communication of Quality in Social Surveys

outcomerate: Transparent Communication of Quality in Social Surveys

Background Surveys are ubiquitous in the social sciences, and the best of them are meticulously planned out. Statisticians often decide on a sample size based on a theoretical design, and then proceed to inflate this number to account for “sample losses”. This ensures that the desired sample size is achieved, even in the presence of non-response. Factors that reduce the...

Read more »

Search R-bloggers


Sponsors

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)