April 2020

Tabulizer and pdftools Together as Super-powers – Part 2

April 5, 2020 | R on Redwall Analytics

# Libraries
packages <- 
  c("data.table",
    "stringr",
    "rlist",
    "tabulizer",
    "pdftools",
    "parallel",
    "DT"
    )

if (length(setdiff(packages,rownames(installed.packages()))) > 0) {
  install.packages(setdiff(packages, rownames(installed.packages())))  
}

invisible(lapply(packages, library, character.only = TRUE))

knitr::opts_chunk$set(comment=NA, fig.width=12, fig.height=8, out.width = '100%')
Introduction This post will be a continuation of Parsing of Mass Municipal PDF CAFR’s with Tabulizer, pdftools and AWS Textract - Part 1 dealing with extracting data from PDFs using R. When Redwall discovered pdftools, and its pdf_data() function, which maps out every word on a pdf page ... [Read more...]

Comments on the COVID-19 model source code from Imperial

April 5, 2020 | Derek Jones

At the end of March a paper modelling the impact of various scenarios on the spread of COVID-19 infections, by the MRC Centre for Global Infectious Disease Analysis at Imperial College, appears to have influenced the policy of the powers that be. This group recently started publishing their modelling code ... [Read more...]

Comments on the COVID-19 model source code from Imperial

April 5, 2020 | Derek Jones

At the end of March a paper modelling the impact of various scenarios on the spread of COVID-19 infections, by the MRC Centre for Global Infectious Disease Analysis at Imperial College, appears to have influenced the policy of the powers that be. This group recently started publishing their modelling code ... [Read more...]

Audio classification in R

April 5, 2020 | Francisco de Abreu e Lima

Visiting Berlin in December 2013, some friends and I spent one evening at the White Trash, a rockabilly-vibe bar decorated with hanging skeletons and a couple of conspicuous portraits of Ian “Lemmy”, the late frontman of Motörhead. Incidentally, I remember trying to guess what song was playing at a certain ...
[Read more...]

Mapping Covid-19 cases: a Shiny app

April 4, 2020 | baskesh

R lets you create charts and graphs in image form. But the Shiny package lets you create those same charts and graphs in interactive format. I created my first Shiny chart: a world map of confirmed Covid-19 cases. Check it out here. Unfortunately I cannot embed the app into this ...
[Read more...]

SR2 Chapter 3 Hard

April 4, 2020 | Brian Callander

SR2 Chapter 3 Hard Posted on 5 April, 2020 by Brian Tags: statistical rethinking, solutions, grid approximation, posterior predictive check, posterior predictive distribution, map, binomial, hpdi Category: statistical-rethinking-2 Here’s my solutions to the hard exercises in chapter 3 of McElreath’s Statistical Rethinking, 2nd edition. \(\DeclareMathOperator{\dbinomial}{Binomial} \DeclareMathOperator{\dbernoulli}{Bernoulli} \DeclareMathOperator{\dpoisson}{... [Read more...]

SR2 Chapter 3 Medium

April 4, 2020 | Brian Callander

SR2 Chapter 3 Medium Posted on 5 April, 2020 by Brian Tags: statistical rethinking, solutions, grid approximation, posterior probability, posterior predictive probability, hpdi, binomial Category: statistical-rethinking-2 Here’s my solution to the medium exercises in chapter 3 of McElreath’s Statistical Rethinking, 2nd edition. \(\DeclareMathOperator{\dbinomial}{Binomial} \DeclareMathOperator{\dbernoulli}{Bernoulli} \DeclareMathOperator{\dpoisson}{Poisson} \DeclareMathOperator{\... [Read more...]

D is for dummy_cols

April 4, 2020 | Unknown

For the letter D, I'm going to talk about the dummy_cols functions, which isn't actually part of the tidyverse, but hey: my posts, my rules. This function is incredibly useful for creating dummy variables, which are used in a variety of ways, including...
[Read more...]

Caching in R

April 4, 2020 | Posts | Joshua Cook

Introduction Caching intermediate objects in R can be an efficient way to avoid re-evaluating long-running computations. The general process is always the same: run the chunk of code once, store the output to disk, and load it up the next time the same chunk is run. There are, of course, ... [Read more...]

Caching in R

April 4, 2020 | Posts | Joshua Cook

Introduction Caching intermediate objects in R can be an efficient way to avoid re-evaluating long-running computations. The general process is always the same: run the chunk of code once, store the output to disk, and load it up the next time the same chunk is run. There are, of course, ... [Read more...]

patientcounter

April 3, 2020 | HighlandR

Tracking patient movements is difficult It’s over 3 years since I collaborated with Neil Pettinger on some plots to demonstrate patient flow, using R. What started out as a supposed quick blog post morphed into several weeks of work, a blog ...
[Read more...]

GARCHery

April 3, 2020 | R on OSM

In our last post, we discussed using the historical average return as one method for setting capital market expectations prior to constructing a satisfactory portfolio. We glossed over setting expectations for future volatility, mainly because it is such a thorny issue. However, we read an excellent tutorial on GARCH models ...
[Read more...]

patientcounter

April 3, 2020 | HighlandR

fast, flexible census tables - Counting patients is hard Blimey. It’s over 3 years since I collaborated with Neil Pettinger on some plots to demonstrate patient flow, using R. What started out as a supposed quick blog post morphed into sev...
[Read more...]
1 13 14 15 16 17

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)