Using a genetic algorithm for the hyperparameter optimization of a SARIMA model

November 15, 2018
By
Using a genetic algorithm for the hyperparameter optimization of a SARIMA model

Introduction In this blog post, I’ll use the data that I cleaned in a previous blog post, which you can download here. If you want to follow along, download the monthly data. In my last blog post I showed how to perform a grid search the “tidy” way. As an example, I looked for the right hyperparameters of a SARIMA model. However, the goal of...

Read more »

In case you missed it: October 2018 roundup

November 15, 2018
By

In case you missed them, here are some articles from October of particular interest to R users. Peter Provost ports some 80's-era BASIC programs for kids to R. In a podcast for Fringe FM, I discuss the ethics of AI, Microsoft and Open Source, and the R Community. Roundup of AI, Machine Learning and Data Science news from October...

Read more »

Quoting in R

November 15, 2018
By

Many R users appear to be big fans of "code capturing" or "non standard evaluation" (NSE) interfaces. In this note we will discuss quoting and non-quoting interfaces in R. The above terms are simply talking about interfaces where a name to be used is captured from the source code the user typed, and thus does … Continue reading Quoting...

Read more »

A deep dive into glmnet: standardize

November 15, 2018
By
A deep dive into glmnet: standardize

I’m writing a series of posts on various function options of the glmnet function (from the package of the same name), hoping to give more detail and insight beyond R’s documentation. In this post, we will focus on the standardize … Continue reading →

Read more »

Gold-Mining Week 11 (2018)

November 15, 2018
By

Week 11 Gold Mining and Fantasy Football Projection Roundup now available. Go get that free agent gold! The post Gold-Mining Week 11 (2018) appeared first on Fantasy Football Analytics.

Read more »

Make Beautiful Tables with the Formattable Package

November 15, 2018
By
Make Beautiful Tables with the Formattable Package

I love the formattable package, but I always struggle to remember its syntax. A quick Google search reveals that I’m not alone in this struggle....

Read more »

Rcpp now used by 1500 CRAN packages

November 15, 2018
By
Rcpp now used by 1500 CRAN packages

Right now Rcpp stands at 1500 reverse-dependencies on CRAN. The graph is on the left depicts the growth of Rcpp usage (as measured by Depends, Imports and LinkingTo, but excluding Suggests) over time. What an amazing few days this has been as we als...

Read more »

Discourse Network Analysis: Undertaking Literature Reviews in R

November 14, 2018
By
Discourse Network Analysis: Undertaking Literature Reviews in R

Literature reviews are the cornerstone of science. Keeping abreast of developments within any given field of enquiry has become increasingly difficult given the enormous amounts of new research. Databases and search technology have made finding relevant literature easy but, keeping a coherent overview of the discourse within a field of enquiry is an ever more … Continue reading "Discourse...

Read more »

Searching for the optimal hyper-parameters of an ARIMA model in parallel: the tidy gridsearch approach

November 14, 2018
By
Searching for the optimal hyper-parameters of an ARIMA model in parallel: the tidy gridsearch approach

Introduction In this blog post, I’ll use the data that I cleaned in a previous blog post, which you can download here. If you want to follow along, download the monthly data. In the previous blog post, I used the auto.arima() function to very quickly get a “good-enough” model to predict future monthly total passengers flying from LuxAirport. “Good-enough” models can be all you need in...

Read more »

More on Bias Corrected Standard Deviation Estimates

November 14, 2018
By
More on Bias Corrected Standard Deviation Estimates

This note is just a quick follow-up to our last note on correcting the bias in estimated standard deviations for binomial experiments. For normal deviates there is, of course, a well know scaling correction that returns an unbiased estimate for observed standard deviations. It (from the same source): … provides an example where imposing the … Continue reading More...

Read more »

More Sandwiches, Anyone?

November 14, 2018
By

Consider this my Good Deed for the Day!A re-tweet from a colleague whom I follow on Twitter brought an important paper to my attention. I thought I'd share it more widely.The paper is titled, "Small-sample methods for cluster-robust variance estimation and hypothesis testing in fixed effect models", by James Pustejovski (@jepusto) and Beth Tipton (@stats-tipton). It appears in The Journal of Business and Economic...

Read more »

Use GitHub Vulnerability Alerts to Keep Users of Your R Packages Safe

November 14, 2018
By
Use GitHub Vulnerability Alerts to Keep Users of Your R Packages Safe

Despite their now inherent evil status, GitHub has some tools other repository aggregators do not. One such tool is the free vulnerability alert service which will scan repositories for outdated+vulnerable dependencies. Now, “R” is nowhere near a first-class citizen in the internet writ large, including software development tooling (e.g. the Travis-CI and GitLab continuous integration... Continue reading →

Read more »

anytime 0.3.3

November 14, 2018
By

A new minor clean-up release of the anytime package arrived on CRAN overnight. This is the fourteenth release, and follows the 0.3.2 release a good week ago. anytime is a very focused package aiming to do just one thing really well: to convert anythi...

Read more »

Gazing into the Abyss of P-Hacking: HARKing vs. Optional Stopping

November 14, 2018
By
Gazing into the Abyss of P-Hacking: HARKing vs. Optional Stopping

by Angelika Stefan & Felix Schönbrodt Almost all researchers have experienced the tingling feeling of suspense that arises right before they take a look at long-awaited data: Will they support their favored hypothesis? Will they yield interesting or even groundbreaking results? In a perfect world (especially one without publication bias), the cause of this suspense The post Gazing into...

Read more »

Windows Clipboard Access with R

November 13, 2018
By
Windows Clipboard Access with R

The windows clipboard is a quick way to get data in and out of R. How can we exploit this feature to accomplish our basic data exploration needs and when might its use be inappropriate? Read on. Overview Paste Data from Clipboard to R HTML Tables Spreadsheets Copy Data from R to Clipboard Summary and Usage Notes Paste Data...

Read more »

R plus Magento 2 REST API revisited: part 2 – filtered search

November 13, 2018
By
R plus Magento 2 REST API revisited: part 2 – filtered search

In previous post I wrote about how to set connection with Magento 2 REST API from R and make queries to universal search. Now let’s see how  use filtered search. Also we’ll use other than universal search endpoints that provide access to more specific Magento 2 store information. Using filtered… The post R plus Magento 2 REST API revisited:...

Read more »

A Mathematician’s Perspective on Topological Data Analysis and R

November 13, 2018
By
A Mathematician’s Perspective on Topological Data Analysis and R

A few years ago, when I first became aware of Topological Data Analysis (TDA), I was really excited by the possibility that the elegant theorems of Algebraic Topology could provide some new insights into the practical problems of data analysis. But time has passed, and the sober assessment of Larry Wasserman seems to describe where things stand. TDA is an...

Read more »

Rdew Valley: Optimizing Farming with R

November 13, 2018
By
Rdew Valley: Optimizing Farming with R

I recently picked up a copy of my favorite game Stardew Valley again. If you don’t know the game, I can highly recommend it! You inherit a pixel farm and you are in charge of everything. Crops, animals, fishing, mining and never forget to socialize. My plan was to shut off work for at least a few hours while playing. But at...

Read more »

Easy time-series prediction with R: a tutorial with air traffic data from Lux Airport

November 13, 2018
By
Easy time-series prediction with R: a tutorial with air traffic data from Lux Airport

In this blog post, I will show you how you can quickly and easily forecast a univariate time series. I am going to use data from the EU Open Data Portal on air passenger transport. You can find the data here. I downloaded the data in the TSV format for Luxembourg Airport, but you could repeat the analysis for any airport. Once you...

Read more »

Building a Repository of Alpine-based Docker Images for R, Part II

November 13, 2018
By
Building a Repository of Alpine-based Docker Images for R, Part II

In the first article of this series, I built an Alpine-based Docker image with R base packages from Alpine’s native repositories, as well as one image with R compiled from source code. The images are hosted on Docker Hub, velaco/alpine-r repository...

Read more »

AI for Good: slides and notebooks from the ODSC workshop

November 13, 2018
By
AI for Good: slides and notebooks from the ODSC workshop

Last week at the ODSC West conference, I was thrilled with the interest in my Using AI for Good workshop: it was wonderful to find a room full of data scientists eager to learn how data science and artificial intelligence can be used to help people and the planet. The workshop was focused around projects from the Microsoft AI...

Read more »

Those “other” apply functions…

November 13, 2018
By
Those “other” apply functions…

So you know lapply, sapply, and apply…but…what about rapply, vapply, or eapply? These are generally a little less known as far as the apply family of functions in R go, so this post will explore how they work. rapply Let’s start with rapply. This function has a couple of different purposes. One is to recursively The post Those “other”...

Read more »

Introducing vizscorer: a bot advisor to score and improve your ggplot plots

November 13, 2018
By
Introducing vizscorer: a bot advisor to score and improve your ggplot plots

CategoriesVisualizing Data Tags Data Visualisation ggplot2 R Programming One of the most frustrating issues I face in my professional life is the plentitude of ineffective reports generated within my company. Wherever I look around me is plenty of junk charts, like barplot showing useless 3D effects or ambiguous and crowded pie charts. I do understand the root causes of this desperate state of Related...

Read more »

A deep dive into glmnet: penalty.factor

November 13, 2018
By
A deep dive into glmnet: penalty.factor

The glmnet function (from the package of the same name) is probably the most used function for fitting the elastic net model in R. (It also fits the lasso and ridge regression, since they are special cases of elastic net.) … Continue reading →

Read more »

Installing RStudio & Shiny Servers

November 12, 2018
By

I did a remote install of Ubuntu Server today. This was somewhat novel because it’s the first time that I have not had physical access to the machine I was installing on. The server install went very smoothly indeed. The next tasks were to install RStudio Server and Shiny Server. The installation process for each of these is well documented...

Read more »

Interchanging RMarkdown and “spinnable” R

November 12, 2018
By

Dean Attali wrote this nice post a few years ago describing knitr’s spin function. This function allows a regular R file, with comments written with the roxygen2-style comment tag #' to be rendered as an HTML document with the comments rendered as text and the results of the R code rendered in place, much as a RMarkdown document would. The basic...

Read more »

Shiny 1.2.0: Plot caching

November 12, 2018
By
Shiny 1.2.0: Plot caching

We’re pleased to announce the CRAN release of Shiny v1.2.0! This release features Plot Caching, an important new tool for improving performance and scalability in Shiny apps. If you’re not familiar with the term “caching”, it just means that when we perform a time-consuming operation, we save (cache) the results so that the next time that operation is requested, we...

Read more »

TWIMLAI European Online Meetup about Trust in Predictions of ML Models

November 12, 2018
By
TWIMLAI European Online Meetup about Trust in Predictions of ML Models

At the upcoming This week in machine learning and AI European online Meetup, I’ll be presenting and leading a discussion about the Anchors paper, the next generation of machine learning interpretability tools. Come and join the fun! :-) Date: Tuesda...

Read more »

The Antarctic/Southern Ocean rOpenSci community

Antarctic/Southern Ocean science and rOpenSci Collaboration and reproducibility are fundamental to Antarctic and Southern Ocean science, and the value of data to Antarctic science has long been promoted. The Antarctic Treaty (which came into force in 1...

Read more »

Search R-bloggers


Sponsors

Mango solutions





mckinsey.com

Zero Inflated Models and Generalized Linear Mixed Models with R



datasciencego.com

Quantide: statistical consulting and training

ODSC west

ODSC2 west

datasociety

http://www.eoda.de









Six Sigma Online Training

mljar.com

Our ads respect your privacy. Read our Privacy Policy page to learn more.

Contact us if you wish to help support R-bloggers, and place your banner here.