I read this tweet thread yesterday, and one of the great things in it was discovering that the Department for Transport release traffic count data for Great Britain. When you download individual observation… Continue reading →
The prrd package was introduced recently, and made it to CRAN shortly thereafter. The idea of prrd is simple, and described in some more detail on its webpage and its GitHub repo. Reverse dependency checks are an important part of package development...
If you tend to do lots of large Monte Carlo simulations, you've probably already discovered the benefits of multi-core CPUs and parallel computation. A simulation that takes 4 weeks without parallelization, can easily be done in 1 week on a quad core laptop with parallelization. However, for even larger simulations reducing the ... [Read more...]
With the 2018 FA Cup now in its fourth round, I asked myself: what is the importance of the trophy in relation to league position? If the primary goal for English (and Welsh) teams is succeeding in the league, does the FA Cup help teams with this objective? I decided for ...
Remember the nascent series of blog posts about Parks and recreation? Well, we’re still at one post, but don’t worry, here is a new one, and I’m sure the series will eventually be a real one. I’m looking at you, my R-Ladies friends. That said, today ...
I’ve been writing/talking a lot about LIME recently: in this blog/ at H20 meetup, or at coming AI Congress and I’m still sooo impressed by this tool for interpreting any, even black-box, algorithm! The part I love most is that LIME can be applied to both image ...
Remember the nascent series of blog posts about Parks and recreation? Well, we’re still at one post, but don’t worry, here is a new one, and I’m sure the series will eventually be a real one. I’m looking at you, my R-Ladies friends. That said, ...
I’m happy to announce a new package that has recently appeared on CRAN, called “TSrepr” (version 1.0.0: https://CRAN.R-project.org/package=TSrepr).
The TSrepr package contains methods of time series representations (dimensionality reduction, feature extraction or preprocessing) and several other useful helper methods and functions.
Time series representation can ...
This is part 3 of a 3 part blog post. This post uses the data that was scraped in part 1 and prepared in part 2.
Now that we have the data in a nice format, let’s make a frequency plot! First let’s load the data and the packages:
library("tidyverse")
library("ggthemes") # To use different themes and colors
renert_tokenized = readRDS("renert_tokenized.rds")
In a widely shared video, US Admiral McRaven addressing University of Texas at Austin's Class of 2014 chooses to deliver a simple message: make your bed every day.
A highlight of this talk is the quote The little things in life matter. If you can't do...
The keynotes for eRum 2018 are announced! Check them out and every other information about the international conference taking place in Budapest, this year! ---- The eRum conferences are particularly thought for the many Europeans that can’t manage to take part in the use!R conferences when they are based ... [Read more...]
I'm a big fan using R to simulate data. When I'm trying to understand a data set, my first step is sometimes to simulate data from a model and compare the results to the data, before I go down the path of fitting an analytical model directly. Simulations are easy ... [Read more...]
Last summer we discussed the simplified interface of the 1.0 CRAN release of healthcare.ai-R, and we’re now thrilled to demo new features related to clinician guidance in the 1.2 version. We’re calling this Patient Impact Predictor (PIP). Understanding an ML model This week we’d like to highlight new ... [Read more...]
If you're an organizer of an R-focused meetup group, or are planning a community-led R conference, the 2018 R Consortium R User Group Support Program is now accepting applications for sponsorship. The 2017 program funded 76 user groups and 3 small conferences, and the program is expanding further in 2018. User groups now also receive ... [Read more...]
"It turns out that style matters in programming for the same reason that it matters in writing. It makes for better reading.“ Douglas Crockford in JavaScript: The Good Parts Why do we need yet another style guide? "The reason to care about a style guide is just one thing: We ... [Read more...]
This blog post series is on machine learning with R. We will use the Caret package in R. In this part, we will first perform exploratory Data Analysis (EDA) on a real-world dataset, and then apply non-regularized linear regression to solve a supervised regression problem on the dataset. We will ...
Unfortunately this was not taught in any of my statistics or data analysis classes at university (wtf it so needs to be :scream_cat:).
So it took me some time until I learned that the AUC has a nice probabilistic meaning.
What’s AUC anyway?
AUC is the area under ... [Read more...]
Unfortunately this was not taught in any of my statistics or data analysis classes at university (wtf it so needs to be :scream_cat:).
So it took me some until I learned that the AUC has a nice probabilistic meaning.
What’s AUC anyway?
Consider:
A dataset : , where
is a ... [Read more...]
Subscribe to TheAutomatic.net via the area on the right side of the page. The yahoo_fin package contains functions to scrape stock-related data from Yahoo Finance and NASDAQ. You can view the official documentation by clicking this link, but the below post will provide a few more in-depth examples. ...