In this post, I present my new package {img2coord}. This package can be used to retrieve coordinates from a scatter plot (as an image). devtools::install_github("privefl/img2coord") Have you ever made a plot, saved it as a png and moved on? When you come back to ...
This is the final entry in a three-part series about the plumber package. The first post introduces plumber as an R package for building REST API endpoints in R. The second post builds a working example of a plumber API that powers a Slack slash command. In this final entry, ... [Read more...]
Hugo Bowne-Anderson, the host of DataFramed, the DataCamp podcast, recently interviewed Cathy O'Neil, author of the blog mathbabe.org and several books on data science, including Weapons of Math Destruction.
Here is the podcast link.
Introducing Cathy O'Neil
Hugo: Hi there, Cathy, and welcome to DataFramed.
Cathy: Thank you. I'm ...
In a previous article I announced AzureR, a new family of packages for working with Azure from R. This article goes into more detail on how you can use AzureRMR, the base package of the AzureR family, to manage resources with Azure Resource Manager. Before you begin The first thing ...
The first time we at STATWORX are organizing a workshop for anyone who wants to learn more about data and graphics. Our “Data Visualization in R and ggplot2” workshop is designed for R-beginners and provides the perfect mixture between theory and practice. The workshop will be held at Mindspace in ...
Hi there!
Our manuscript about rWind R package has been recently accepted for publication in Ecography! As you know, rWind is a tool used to download and manage wind data, with some utilities that make easy to include wind information in ecological or evolutionary analyses (or others!).
Though there are ...
The new 0.4.6 release of RQuantLib arrived on CRAN and Debian earlier today. It is two-fold update: catching up QuantLib 1.14 while also updating to Boost 1.67 (and newer).
A special thanks goes to Josh for updating to the binary windows library in t... [Read more...]
I’ve released pqR-2018-11-18, a new version of my variant implementation of R. You can install it on Linux, Windows, or Mac as described at pqR-project.org. Installation must currently be from source, similarly to source installs of R Core versions of R. This version has some major ...
In my last Statistics Sunday post, I briefly mentioned the concept of regular expressions, also known as regex (though note that in some contexts, these refer to different things - see here). A regular expression is a text string, which you ask your program to match. You can use this ...
Nate Silver and the FiveThirtyEight folks talked quite a bit this election cycle about how their model was probabilistic. What this means is that they don’t just offer a single prediction (e.g., “Democrats will pick up 10 seats”); instead, their predic... [Read more...]
In the post (https://statcompute.wordpress.com/2018/11/23/more-robust-monotonic-binning-based-on-isotonic-regression), a more robust version of monotonic binning based on the isotonic regression was introduced. Nonetheless, due to the loss of granularity, the predictability has been somewhat compromised, which is a typical dilemma in the data science. On one hand, we don’t ... [Read more...]
We already saw the power of the OneR package in the preceding post. Here we want to give some more examples to gain some fascinating, often counter-intuitive, insights. Shirin Glander of Muenster University tested the OneR package with data from the World Happiness Report to find out what makes people ... [Read more...]
The Game, the Rules
Estimating the Average Game Length
Estimating the Rate of Goals Scored
Simulating the Games
R Setup
Equal Teams
Unequal (but matched) Teams
Conclusion
Much has been said about how the game of Quidditch is ruined by the scoring system - specifically how it makes no sense ... [Read more...]
The Game, the Rules
Estimating the Average Game Length
Estimating the Rate of Goals Scored
Simulating the Games
R Setup
Equal Teams
Unequal (but matched) Teams
Conclusion
Much has been said about how the game of Quidditch is ruined by the scoring system - specifically how it makes no sense ... [Read more...]
The Game, the Rules
Estimating the Average Game Length
Estimating the Rate of Goals Scored
Simulating the Games
R Setup
Equal Teams
Unequal (but matched) Teams
Conclusion
Much has been said about how the game of Quidditch is ruined by the scoring system - specifically how it makes no sense ... [Read more...]
In a recent post, I have discussed performance measures for model selection. This time, I write about a related topic: performance measures that are suitable for selecting models when performing feature selection. Since feature selection is concerned with reducing the number of dependent variables, suitable performance measures evaluate the trade-off ... [Read more...]
The awesome blog post by Tyler Morgan-Wall on 3d printing maps with his rayshader package rekindled an old desire of mine: Sometimes I would like to touch data. I am a big fan of data visualization and being able to add a third dimension and this haptic feel to the ...
Introduction
This blog posts will use several packages from the
{tidymodels} collection of packages, namely
{recipes},
{rsample} and
{parsnip} to train a random forest the tidy way. I will
also use {mlrMBO} to tune the hyper-parameters of the random forest.
Set up
Let’s load the needed packages:
I have completed the polishing/correcting/fiddling of the eight statistical analysis related chapters of my evidence-based software engineering book, and an updated draft pdf is now available (download here). The material was in much better shape than I recalled, after abandoning it to the world 2-years ago, to work ... [Read more...]
Much has been said about how the game of Quidditch is ruined by the scoring system - specifically how it makes no sense that the snitch is worth 150 points and that catching it ends the game [1, 2, 3]. Most of these arguments seem to revolve around the claim that it is nearly ...