Moving into 2020, three things are clear - Organizations want Data Science, Cloud, and Apps. Here are the Top 5 essential skills for Data Scientists that need to build and deploy applications in 2020 and beyond.
This is part of a series of arti...
In this new post about the apply family functions, we’ll show some examples of three new functions: lapply(), sapply(), and vapply(). These functions generally works with objects of class list; however, other variants can work with vectors.
Suppose n... [Read more...]
Overview
Maps are apart of our everyday lives. They can be seen in almost every industry and are useful for numerous different purposes. A lot of the time these maps are static. Sometimes, however, it’s extremely useful to interact with the data on the map. This is where the ... [Read more...]
Jumping Rivers are coming to San Francisco in January 2020! We'll be running a number of R training courses with Paradigm Data. You can find the booking links and more details over at our courses page. Don't be afraid to get in contact if you have any questions! 22nd January - ...
I have decided to tackle this year’s Advent Of Code using R (more or less). I know there are more preferred languages, such as Python, C#, Java, JavaScript, Go, Kotlin, C++, Elixir, but it was worth trying. Into the 8th…Read more ›
Which States Impose the Most “Tax Pain?”
Much of the discussion around tax burdens focuses on income taxes but, at the state level, that leaves out two other big sources of tax liability, sales and property taxes. Here we’ll quickly look at the interplay of all three taxes in ...
I was wondering the realms of StackOver Flow answering some questions when I encoutered a question that looked to extract some parts of a string based on a regex. I thought I knew how to do this with the package stringr using, for example, str_sub but I found it ... [Read more...]
Solving Advent of Code 2019-08 with R and JavaScript.
[Disclaimer] Obviously, this post contains a big spoiler about Advent
of Code, as it gives solutions for solving day 8.
About the JavaScript code
The JavaScript code has been written in the same ...
Tidy evaluation is a framework for controlling how expressions and
variables in your code are evaluated by
tidyverse functions. This framework,
housed in the rlang package, is a powerful
tool for writing more efficient and elegant code. In particular, you’ll
find it useful for passing variable names as inputs ...
As of earlier this year, we are now automatically building binaries and pkgdown documentation for all rOpenSci packages. One issue we encountered is that some packages include vignettes that require some special tools/data/credentials, which are unavailable on generic build servers.
This post explains how to include such vignettes ...
Inset maps enable multiple places to be shown in the same geographic data visualisation, as described in the Inset maps section (8.2.7) of our open source book Geocomputation with R.
The topic of inset maps has gained attention and recently Enrico Spinielli asked inset maps could be created for data in ...
This is the methodological companion to my post on
my proposed method for evaluating NFL kickers. To get all the results and
some extra info about the data, check it out.
When you have a hammer, everything looks like a nail, right? Well I'm a big
fan of multilevel models ... [Read more...]
A digit puzzle as Le weekly Monde current mathematical puzzle that sounds close to some earlier versions: Perfect squares are pairs (a²,b²) with the same number of digits such that a²b² is itself a square. What is the pair providing a²b² less than 10⁶? Is there a solution ...
Our goal has been to make rquery the best query generation system for R (and to make data_algebra the best query generator for Python). Lets see what rquery is good at, and what new features are making rquery better. The idea is: the query is a first class citizen ... [Read more...]
A small Armadillo bugfix upstream update 9.800.3 came out a few days ago. The changes, summarised by Conrad in email to me (and for once not yet on the arma site are fixes for matrix row iterators, better detection of non-hermitian matrices by eig_s... [Read more...]
A new version, now at 0.2.1, of the random-number generator tester RDieHarder (based on the DieHarder suite developed / maintained by Robert Brown with contributions by David Bauer and myself) is now on CRAN.
This version has only internal changes. B... [Read more...]
Solving Advent of Code 2019-07 with R.
[Disclaimer] Obviously, this post contains a big spoiler about Advent
of Code, as it gives solutions for solving day 7.
Instructions
Find the instructions at: https://adventofcode.com/2019/day/7
R solution
ve... [Read more...]
Online appendices detailing the robustness of empirical analyses are
paramount but they never let readers explore all reasonable researcher
degrees of freedom. Simonsohn, Simmons and Nelson suggest
a ‘specification curve’ that allows readers to eyeball how a main coefficient
of interest varies across a wide arrange of specifications. I build ...
Creating a map in ggplot2 can be surprisingly easy! This tutorial will show the US by state. The dataset is from 1970 and will show some US statistics including population, income, life expectancy, and illiteracy. I love making maps, while predictive statistics provide such great insight, map making was one thing ...
Intro + hackathon description Have you ever participated in a hackathon? A month ago my answer was still no (despite the fact that I’ve been working in the R Shiny/Data Science space for 5 years). Now I have checked it off my list. And my team won, so I’m 1 ...