Blog Archives

Introduction to ORCID Researcher Identifiers in R with rorcid

This article provides a practical introduction to the rorcid package from ROpenSci to access the ORCID researcher ID API. ORCID stands for Open Researcher or Contributor ID (Haak et al. 2012; Meadows 2016; Youtie et al. 2017). ORCID is a non-profit organisation that provides researchers with a free unique researcher identifier and a profile. To date over 5 million...

Read more »

Separating and Trimming Messy Data the Tidy Way

When working with table data from the scientific or patent literature, it is extremely common to find that columns contain concatenated data. That is, they contain multiple entries with a semicolon as a separator. Data of this type is not tidy (Wickham 2014). What we commonly want to do is to separate the data out as the basis for...

Read more »

API Resources for the Scientific Literature in R and Python

This short post provides details on some of the main APIs (web services) that can be used to monitor and retrieve data from the scientific literature in either R or Python. We are using these packages and libraries as part of a GIZ supported project with the authorities in Kenya who are responsible for providing research permits. Kenya is...

Read more »

Creating A Bibliography with rcrossref

Creating A Bibliography with rcrossref

In this article we will look at how to create a bibliography using the rcrossref package in R with RStudio. To help us get started we will also look at how to reference any R packages that you use in your work with knitr. The article is part of work in progress for the WIPO Handbook on Patent Analytics....

Read more »

Exploring Geocoding Scientific Literature with R

Exploring Geocoding Scientific Literature with R

Getting Started Getting set up with the Google Maps API Using the API The Source Data Lookup the Records Using placement Using ggmap Using Googleway Reviewing Initial Results Tackling Abbreviations Lookup edited names Bringing the data together Assessing the Quality of Geocoding Preprocess the Data and Rerun the Query Duplicated Affiliation Names Quickly Mapping the Data Round Up References In this article we will explore geocoding using the Google Maps API and the placement, ggmap, and googleway...

Read more »

Counting Patent First Filings the Tidy Way with R

Counting Patent First Filings the Tidy Way with R

This article provides an in depth introduction to counting patent first filings or priority counts. It is a work in progress chapter for the WIPO Patent Analytics Handbook focusing on advanced patent analytics and builds on the introductory WIPO Manual on Open Source Patent Analytics. Counting first filings is an important subject for patent statistics because the first filing of...

Read more »

An Update on Importing Excel Data in R

February 5, 2018
By
An Update on Importing Excel Data in R

Introduction Import Directly from the RStudio Menu Reading an Excel file from a URL Tidying column names with janitor Exporting to Excel Round Up Introduction Back in 2015 I wrote a long blog post on importing Excel tables into R. Happily for everyone this is now a lot easier than it was. This post provides an update on importing spreadsheets into R and exporting from R...

Read more »

Dr. Evil meets the robotstxt package

I am fairly new to webscraping in R using rvest and one question is whether a site gives permission for scraping. This information is often contained in the robots.txt file on a website. So, I’m briefly going to explore the ROpenSci robotstxt package by Peter Meissner. robotstxt provides easy access to the robots.txt file for a domain from R. I’m...

Read more »

About

December 26, 2017
By

My name is Paul Oldham, I hold a PhD in Social Anthropology from the London School of Economics and I’m the lead at One World Analytics. My major interest is the conservation of biodiversity and the rights of indigenous peoples. For much of the 1990s and into the 2000s I worked with a people called the Piaroa in Amazonas...

Read more »

Creating an Infographic with infogram

Creating an Infographic with infogram

In this article we will use RStudio to prepare patent data for visualisation in an infographic using the online software tool infogram. Infographics are a popular way of presenting data in a way that is easy for a reader to understand without reading a long report. Infographics are well suited to presenting summaries of data with simple messages about key...

Read more »

Search R-bloggers

Sponsors

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)