Mapping Covid-19 cases: a Shiny app

April 4, 2020
By
Mapping Covid-19 cases: a Shiny app

R lets you create charts and graphs in image form. But the Shiny package lets you create those same charts and graphs in interactive format. I created my first Shiny chart: a world map of confirmed Covid-19 cases. Check it out here. Unfortunately I cannot embed the app into this website right now, so the … Continue reading Mapping...

Read more »

The significance of population size, year, age and per cent women on the education level in Sweden

The significance of population size, year, age and per cent women on the education level in Sweden

In my last post, I analysed how the level of education is affected by region, sex and year. In this post, I will continue with the same dataset but this time I will include age in the analysis. Please send suggestions for improvement of the analysis to [email protected] First, define libraries and functions. library (tidyverse) ## -- Attaching packages -------------------------------------------------- tidyverse 1.3.0...

Read more »

SR2 Chapter 3 Hard

April 4, 2020
By
SR2 Chapter 3 Hard

SR2 Chapter 3 Hard Posted on 5 April, 2020 by Brian Tags: statistical rethinking, solutions, grid approximation, posterior predictive check, posterior predictive distribution, map, binomial, hpdi Category: statistical-rethinking-2 Here’s my solutions to the...

Read more »

SR2 Chapter 3 Medium

April 4, 2020
By
SR2 Chapter 3 Medium

SR2 Chapter 3 Medium Posted on 5 April, 2020 by Brian Tags: statistical rethinking, solutions, grid approximation, posterior probability, posterior predictive probability, hpdi, binomial Category: statistical-rethinking-2 Here’s my solution to the medium exercises...

Read more »

On the “correlation” between a continuous and a categorical variable

April 4, 2020
By
On the “correlation” between a continuous and a categorical variable

Let us get back on the Titanic dataset, loc_fichier = "http://freakonometrics.free.fr/titanic.RData" download.file(loc_fichier, "titanic.RData") load("titanic.RData") base = base On consider two variables, the age (the continuous one) and the survivor indicator (the qualitative one) X = base$Age Y = base$Survived It looks like the age might be a valid explanatory variable in the logistic regression, summary(glm(Survived~Age,data=base,family=binomial)) Coefficients: Estimate Std. Error...

Read more »

D is for dummy_cols

April 4, 2020
By
D is for dummy_cols

For the letter D, I'm going to talk about the dummy_cols functions, which isn't actually part of the tidyverse, but hey: my posts, my rules. This function is incredibly useful for creating dummy variables, which are used in a variety of ways, including...

Read more »

Caching in R

April 4, 2020
By

Introduction Caching intermediate objects in R can be an efficient way to avoid re-evaluating long-running computations. The general process is always the same: run the chunk of code once, store the output to disk, and load it up the next time the same chunk is run. There are, of course, multiple packages in R to help with this process, so I’ve decided to outline some...

Read more »

Another “flatten the COVID-19 curve” simulation… in R

April 3, 2020
By
Another “flatten the COVID-19 curve” simulation… in R

 Hi there,This is the best meme I've found during these days... Well, here it is my "BUT" contribution. Some weeks ago The Washington Post published this simulations about how "social distancing" could help to "flat the curve" of COVID-19 infections. I fell in love with these simulations because their simplicity and explanatory power. Indeed, you can use the pretty similar...

Read more »

GARCHery

April 3, 2020
By
GARCHery

In our last post, we discussed using the historical average return as one method for setting capital market expectations prior to constructing a satisfactory portfolio. We glossed over setting expectations for future volatility, mainly because it is such a thorny issue. However, we read an excellent tutorial on GARCH models that inspired us at least to take a stab...

Read more »

patientcounter

April 3, 2020
By
patientcounter

fast, flexible census tables - Counting patients is hard Blimey. It’s over 3 years since I collaborated with Neil Pettinger on some plots to demonstrate patient flow, using R. What started out as a supposed quick blog post morphed into sev...

Read more »

Why R? Webinar on R/exams for E-Learning Quizzes and Beyond

April 3, 2020
By
Why R? Webinar on R/exams for E-Learning Quizzes and Beyond

Webinar hosted by Why R? Foundation: Slides, video, and e-learning resources. Why R? Webinar 001 R/exams was presented in the first Why R? webinar by Achim Zeileis. Marcin Kosiński was the host and...

Read more »

COVID-19 shiny / plotly dashboard

April 3, 2020
By
COVID-19 shiny / plotly dashboard

Governments and COVID-19: Which one stops it faster, better, has fewer people dying? These questions get answered with my dashboard. A contribution to the shiny-contest: https://community.rstudio.com/t/material-design-corona-covid-19-dashboard-2020-shiny-contest-submission/59690 Intro How did Corona spread? Using the animation feature of R-shiny this can be easily tracked.COVID-19 is the major topic in all news channels. The place I live in is Munich, Germany. Within weeks...

Read more »

RcppSimdJson 0.0.4: Even Faster Upstream!

April 3, 2020
By

A new (upstream) simdjson release was announced by Daniel Lemire earlier this week, and my Twitter mentions have been running red-hot ever since as he was kind enough to tag me. Do look at that blog post, there is some impressive work in there. We wr...

Read more »

C is for coalesce

April 3, 2020
By
C is for coalesce

For the letter C, we'll talk about the coalesce function. If you're familiar with SQL, you may have seen this function before. It combines two or more variables into a single column, and is a way to deal with missing data. When you give it a list of va...

Read more »

Introductory videos for Explanatory Model Analysis with R

April 3, 2020
By
Introductory videos for Explanatory Model Analysis with R

Remote teaching at my university encouraged me to prepare some video materials for Explanatory Model Analysis techniques, i.e. techniques of exploration, explanation and visualisation of predictive models.The pyramid for Explanatory Model Analysis. Lef...

Read more »

Custom Power BI visual for Line chart with two Y-Axis

April 3, 2020
By
Custom Power BI visual for Line chart with two Y-Axis

Power BI support certain type of visuals that are by default available in the document. These are absolutely great and work perfectly fine, have a lot of capabilities to set properties and change the settings. But every so often in…Read more ›

Read more »

Exploring and Benchmarking Oxford Government Response Data

Exploring and Benchmarking Oxford Government Response Data

Exploring and Benchmarking Oxford Government Response Data Assessing the impact of Non-Pharmaceutical Interventions on the spread of Covid-19 requires data on Governmental measures. Luckily, the Assessment Capacities Project (ACAPS) and the Oxford Covid-19 Government Response Tracker both provide such data. In this blog post, I explore the new data provided by the Oxford initiative and compare it against the data...

Read more »

RQuantLib 0.4.12: Small QuantLib 1.18 update

April 2, 2020
By

A new release 0.4.12 of RQuantLib arrived on CRAN today, and was uploaded to Debian as well. QuantLib is a very comprehensice free/open-source library for quantitative finance; RQuantLib connects it to the R environment and language. This version doe...

Read more »

B is for bind_rows

April 2, 2020
By
B is for bind_rows

Moving on to the letter B, today we'll talk about merging datasets that contain the same variables but add new cases. This is easily done with bind_rows. Let's say I realized I forgot to log some of the books I read last year, and I wanted to merge tho...

Read more »

poorman: First Release of a base R dplyr Clone

April 1, 2020
By
poorman: First Release of a base R dplyr Clone

Introduction The first official release of poorman (v 0.1.9) is now on CRAN! You can now install poorman directly from CRAN with the following code: install.packages("poorman") In this blog post I want to address some common questions that I have received since I started writing the package. What is poorman? poorman is a package that unapologetically attempts to recreate the dplyr API in a...

Read more »

RStudio Connect 1.8.2

April 1, 2020
By
RStudio Connect 1.8.2

A big update for our Python community One of the biggest frustrations for a data scientist, whether your primary language is R or Python, is to have your hard work go underutilized. A stream of disposable reports, emails, and presentations that get viewed once and cast aside are not the ideal recipe for how to make an impact. To combat...

Read more »

Bootstrap resampling with #TidyTuesday beer production data

April 1, 2020
By
Bootstrap resampling with #TidyTuesday beer production data

I’ve been publishing screencasts demonstrating how to use the tidymodels framework, from first steps in modeling to how to tune more complex models. Today, I’m using this week’s #TidyTuesday dataset on beer production to show how to use bootstrap resampling to estimate model parameters. Here is the code I used in the video, for those who prefer...

Read more »

COVID-19 decease animation map

April 1, 2020
By
COVID-19 decease animation map

Coronavirus decease in France# Animation carto décès COVID 19 France # avec lissage # sources ----------------------------------------------------------------- # https://www.data.gouv.fr/fr/datasets/donnees-hospitalieres-relatives-a-lepidemie-de-covid-19/ fichier_covid % group_by(dep) %__% summarise(deces = sum(dc, na.rm = TRUE), reanim = sum(rea, na.rm...

Read more »

AutoML Frameworks in R & Python

April 1, 2020
By
AutoML Frameworks in R & Python

In last few years, AutoML or automated machine learning as become widely popular among data science community. Big tech giants like Google, Amazon and Microsoft have started offering AutoML tools. There is still a split among data scientists when it… Read More AutoML Frameworks in R & Python

Read more »

{uaparserjs} Updated on CRAN & Using webpack to Make {V8} Application Bundles

April 1, 2020
By

Just a quick note that thanks to a gentle nudge an updated version of {uaparser} — a package that processes User Agent strings web clients send to servers — is making its way to all the CRAN mirrors and is also available on CINC. The most significant change is a much overdue update to the... Continue reading →

Read more »

Top Data Science Blogs

April 1, 2020
By

As a data scientist, I always seek to learn about out new tools and techniques. Although research papers are a great resource to learn, they are mostly either theoretical or lack in hands on explanation. Blogs are a great way… Read More Top Data Science Blogs

Read more »

A is for arrange

April 1, 2020
By
A is for arrange

The arrange function allows you to sort a dataset by one or more variable, either ascending or descending. This function is especially helpful if you plan on aggregating your data with summarize (which, we'll get to later), so you can select specific r...

Read more »

Making Of: A Free API For COVID-19 Data

April 1, 2020
By
Making Of: A Free API For COVID-19 Data

For all those, who are struggling with the (kind of weird) Johns Hopkins University COVID-19 case data CSV files, we've created a free API that makes it easy to integrate the latest worldwide COVID-19 data into your application. Der Beitrag Making Of: A Free API For COVID-19 Data erschien zuerst auf STATWORX.

Read more »

Crowdfight Covid-19 – Call for R-community action

April 1, 2020
By

Dear R-community, This is a rather unusual post for an R-blog, but unusual times may require that. I am not involved into organizing this effort except for volunteering in the same way I want you to do. So, if you aren’t aware of this initiative yet, please likewise consider subscribing to Crowdfight Covid-19. Motivation: Most current plans … Continue reading Crowdfight...

Read more »

Search R-bloggers

Sponsors