Blog Archives

Release mongolite 1.0

March 13, 2017
By
Release mongolite 1.0

After 2.5 years of development, version 1.0 of the mongolite package has been released to CRAN. The package is now stable, well documented, and will soon be submitted for peer review to be onboarded in the rOpenSci suite. MongoDB in R and mongolite I started working on mongolite in September 2014, and it was first announced at the rOpenSci

Read more »

Using xml schema and xslt in R

January 10, 2017
By

This week an update for xml2 and a new xslt package have appeared on CRAN. A full announcement for xml2 version 1.1 will appear on the rstudio blog. This post explains xml validation (via xsd schema) and xml transformation (via xslt stylesheets) which have been added in this release. XML schemas and stylesheets are not exactly new;...

Read more »

Update jsonlite 1.2

January 4, 2017
By

A new version of jsonlite package to CRAN. This is a maintenance release with enhancements and bug fixes. A summary of changes in v1.2 from the NEWS file: Add read_json and write_json convenience wrappers, #161 Update modp_numtoa from upstream, fixes a rounding issue in #148. Ensure asJSON.POSIXt does not use sci notation for negative values, #155 Tweak num_to_char...

Read more »

Announcing pdftools 1.0

December 9, 2016
By

This week we released version 1.0 of the ropensci pdftools package to CRAN. Pdftools provides utilities for extracting text, fonts, attachments and other data from PDF files. It also supports rendering of PDF files into bitmap images. This release has a few internal enhancements and fixes an annoying bug for landscape PDF pages. The version bump to 1.0 signifies...

Read more »

Tesseract Update: Options and Languages

December 8, 2016
By

A few weeks ago we announced the first release of the tesseract package: a high quality OCR engine in R. We have now released an update with extra features. Installing Training Data As explained in the first post, the tesseract system is powered by language specific training data. By default only English training data is installed. Version...

Read more »

High Performance CommonMark and Github Markdown Rendering in R

December 2, 2016
By
High Performance CommonMark and Github Markdown Rendering in R

This week the folks at Github have open sourced their fork of libcmark (based on the extensive PR by Mathieu Duponchelle), which they use to render markdown text within documents, issues, comments and anything else on the Github website. The new release of the commonmark R package incorporates this library so that we can take advantage...

Read more »

The new Tesseract package: High Quality OCR in R

November 16, 2016
By
The new Tesseract package: High Quality OCR in R

Optical character recognition (OCR) is the process of extracting written or typed text from images such as photos and scanned documents into machine-encoded text. The new rOpenSci package tesseract brings one of the best open-source OCR engines to R. This enables researchers or journalists, for example, to search and analyze vast numbers of documents that are only available...

Read more »

Encryption and Digital Signatures in R using GPG

October 19, 2016
By

A new package gpg has appeared on CRAN. From the package description: Bindings to GnuPG for working with OpenGPG (RFC4880) cryptographic methods. Includes utilities for public key encryption, creating and verifying digital signatures, and managing your local keyring. Note that some functionality depends on the version of GnuPG that is installed on the system. In particular GnuPG 2 mandates...

Read more »

New package graphql: A GraphQL Query Parser

October 5, 2016
By

The new ropensci graphql package is now on CRAN. It implements R bindings to the libgraphqlparser C++ library to parse GraphQL syntax and export the syntax tree in JSON format: graphql2json("{ field(complex: { a: { b: } }) }") A syntax parser is perhaps not super useful to most end-users, but...

Read more »

Hunspell 2.0: High-Performance Stemmer, Tokenizer, and Spell Checker for R

September 11, 2016
By
Hunspell 2.0: High-Performance Stemmer, Tokenizer, and Spell Checker for R

A new version of the ropensci hunspell package has been released to CRAN. Hunspell is the spell checker library used by LibreOffice, OpenOffice, Mozilla Firefox, Google Chrome, Mac OS-X, InDesign, Opera, RStudio and many others. It provides a system for tokenizing, stemming and spelling in almost any language or alphabet. The R package exposes both the high-level spell-checker...

Read more »

Sponsors

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)