Monthly Archives: April 2019

On Meeting Data Journalists

April 7, 2019
By
On Meeting Data Journalists

“I’d rather do data than date”. I overheard this while eavesdropping on a conversation among three female data journalists while waiting for an elevator at the IRE-CAR (Investigative Reporters and Editors - Computer-Assisted Reporting) conference last month. I would like to think the remark was overloaded with hyperbole, but maybe not. Most of the attendees as this conference were...

Read more »

Rblpapi 0.3.10: B-PIPE support and other updates

April 7, 2019
By

A new version of Rblpapi arrived at CRAN today as version 0.3.10. It regroups several sets of changes (see below). The most notable addition is an extension to use B-PIPE for high-performance bulk data access (in case you are provisioned for it) cont...

Read more »

A Limited-but-Functional Couchbase Free Text Search & Retrieval Un-package; or, “How I Abused Couchbase & R to Perform Bulk IP Whois Full-text Searches” (a Cobbler’s Tale)

April 7, 2019
By
A Limited-but-Functional Couchbase Free Text Search & Retrieval Un-package; or, “How I Abused Couchbase & R to Perform Bulk IP Whois Full-text Searches” (a Cobbler’s Tale)

Researching “the internet” (i.e. $DAYJOB) means having to deal with a ton of “unique” (I’m being kind) data formats. This is ultimately a tale of how I performed full-text searches across one of them. It all started off innocently enough. This past week I need to be able to do full-text searches across metadata about... Continue reading →

Read more »

How many downloads does my package have?

April 7, 2019
By
How many downloads does my package have?

Everyone that authors an R package is curious about how many users download it. As far as I know there’s still no way to get information on all the downloads, from all the R mirrors. Here I’m using package cranlogs, which only gives information on the downloads from the R Studio mirror. It also does … Continue reading How...

Read more »

Writing R documentation, simplified

April 7, 2019
By
Writing R documentation, simplified

A massive part of statistical software development is the documentation. Good documentation is more than just a help file, it serves as commentary on how the software works, includes use cases, and cites any relevant sources. One cool thing about R documentation is that it uses a system that allows it to be put into a variety of...

Read more »

How to install R on Mac, Ubuntu and Windows

April 6, 2019
By
How to install R on Mac, Ubuntu and Windows

Original post published outside my blog on May 17th, 2018. I decided to edit the videos with kdenlive again and create both a new post and video. Motivation This is a beginner guide that is designed to save yourself a headache and valuable time if you decide to install R yourself. I am a professor and I don’t like to spend one...

Read more »

How to install R on Mac, Ubuntu and Windows

April 6, 2019
By
How to install R on Mac, Ubuntu and Windows

Original post published outside my blog on May 17th, 2018. I decided to edit the videos with kdenlive again and create both a new post and video. Motivation This is a beginner guide that is designed to save yourself a headache and valuable time if you decide to install R yourself. I am a professor and I don’t like to spend one...

Read more »

Historical newspaper scraping with {tesseract} and R

Historical newspaper scraping with {tesseract} and R

I have been playing around with historical newspapers data for some months now. The “obvious” type of analysis to do is NLP, but there is also a lot of numerical data inside historical newspapers. For instance, you can find these tables that show the market prices of the day in the L’Indépendance Luxembourgeoise: I wanted to see how easy it was to...

Read more »

Not Always C++’s Fault

April 6, 2019
By

From the recent developer.r-project.org “Staged Install” article: Incidentally, there were just two distinct (very long) lists of methods in the warnings across all installed packages in my run, but repeated for many packages. It turned out that they were lists of exported methods from dplyr and rlang packages. These two packages take very long to … Continue reading Not...

Read more »

Rborist 0-1.17 on CRAN

April 6, 2019
By

Version 0-1.17 of the Rborist package is building successfully on CRAN. This is the last of the 0-1 series; future versions will instead employ a more conventional API numbering scheme. Two new command-line options have been introduced. Option nThread permits limiting the number of OpenMP threads employed in parallel regions of code. Option "oob" constrains prediction to out-of-bag samples, facilitating the computation of permutation-based variable...

Read more »

Search R-bloggers

Sponsors

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)