Making a Profit with Henry Wan in Arkham Horror: The Card Game

December 3, 2018
By
Making a Profit with Henry Wan in Arkham Horror: The Card Game

Introduction The Forgotten Age cycle of Arkham Horror is at a close and Fantasy Flight Games already announced the next cycle, The Circle Undone. Not only that, they’ve announced two mythos packs at a rate that… surprised me. A new cycle announcement and two mythos pack announcements in less than two months? Am I the…Read more Making a Profit...

Read more »

AzureVM: managing virtual machines in Azure

December 3, 2018
By

This is the next article in my series on AzureR, a family of packages for working with Azure in R. I’ll give a short introduction on how to use AzureVM to manage Azure virtual machines, and in particular Data Science Virtual Machines (DSVMs). Creating a VM Creating a VM is as simple as using the create_vm method, which is...

Read more »

Automated Dashboard with Visualization and Regression for Healthcare Data

December 3, 2018
By
Automated Dashboard with Visualization and Regression for Healthcare Data

CategoriesProgramming Tags Data Visualisation Linear Regression R Programming Tips & Tricks In this article, you learn how to make a Automated Dashboard with Visualization and Regression for Healthcare Data in R. First you need to install the `rmarkdown` package into your R library. Assuming that you installed the `rmarkdown`, next you create a new `rmarkdown` script in R. I use the healthcare insurance data from...

Read more »

Day 03 – little helper multiplot

December 3, 2018
By
Day 03 – little helper multiplot

We at STATWORX work a lot with R and we often use the same little helper functions within our projects. These functions ease our daily work life by reducing repetitive code parts or by creating overviews of our projects. At first, there was no plan to make a package, but soon I realised, that it will be much easier...

Read more »

An Utility Function For Monotonic Binning

December 2, 2018
By

In all monotonic algorithms that I posted before, I heavily relied on the smbinning::smbinning.custom() function contributed by Herman Jopia as the utility function generating the binning output and therefore feel deeply indebted to his excellent work. However, the availability of smbinning::smbinning.custom() function shouldn’t become my excuse for being lazy. Over the weekend, I drafted a

Read more »

Introduction to chartbookR

Introduction to chartbookR

”“Data scientists … spend from 50 percent to 80 percent of their time … collecting and preparing unruly digital data, before it can be explored for useful nuggets.” (— New York Times). The chartbookR package allows the convenient creation of economic and financial data chartbooks. It handles most of the data wrangling and can thus create large chartbooks with few...

Read more »

Statistics in Glaucoma: Part I

December 2, 2018
By
Statistics in Glaucoma: Part I

Samuel Berchuck is a Postdoctoral Associate in Duke University’s Department of Statistical Science and Forge-Duke’s Center for Actionable Health Data Science. Joshua L. Warren is an Assistant Professor of Biostatistics at Yale University. Introduction Glaucoma is a leading cause of blindness worldwide, with a prevalence of 4% in the population aged 40-80. The disease is characterized by retinal ganglion cell death and...

Read more »

One Recipe Step to Rule Them All

In this post I will demonstrate, how my new R package customsteps can be used to create recipe steps, that apply custom transformations to a data set. Note, you should already be fairly familiar with the recipes package before you continue reading this post or give customsteps a spin! Recommended music for this reading session: Introducing the customsteps package Along with the recipes...

Read more »

Compare population age structures of Europe NUTS-3 regions and the US counties using ternary color-coding

December 2, 2018
By
Compare population age structures of Europe NUTS-3 regions and the US counties using ternary color-coding

On 28 November 2018 I presented a poster at Dutch Demography Day in Utrecht. Here it is:

Read more »

restez: Query GenBank locally

restez: Query GenBank locally

What is restez? R packages for interacting with the National Center for Biotechnology Information (NCBI) have, to-date, depended on API query calls via NCBI’s Entrez. For computational analyses that require the automated look-up of reams of biological sequence data, piecemeal querying via bandwith-limited requests is evidently not ideal. These queries are not only slow, but they depend on network connections and...

Read more »

Install and Load Multiple R Packages

December 2, 2018
By

In enterprise environment, we generally need to automate the process of installing multiple R packages so that user does not have to install them separately before submitting your program. The function below performs the following operations - First it finds all the already installed R packages Check packages which we want to install are already installed or not. If package is already installed,...

Read more »

2018-12 MetaPost Three Ways

December 2, 2018
By

This report describes three different approaches to communicating between R and MetaPost: importing the PostScript output from MetaPost with the ‘grImport’ package; calling the mpost ...

Read more »

Why R for data science – and not Python?

December 2, 2018
By

There are literally hundreds of programming languages out there, e.g. the whole alphabet of one letter programming languages is taken. In the area of data science there are two big contenders: R and Python. Now why is this blog about R and not Python? I have to make a confession: I really wanted to like … Continue reading "Why...

Read more »

December Reading for Econometricians

December 2, 2018
By

My suggestions for papers to read during December:Askanazi, R., F. X. Diebold, F. Schorfheide, & M. Shin, 2018. On the comparison of interval forecasts. PIER Working Paper 18-013, Penn. Institute for Economic Research, University of Pennsylvania.Me...

Read more »

How to get the homology of a antibody using R

December 2, 2018
By
How to get the homology of a antibody using R

What is homology? Proteins are conserved bio molecules present in all organisms. Some proteins are conserved in many similar species, making them homologues of each other, meaning that the sequence is not the same but there is a degree of similarity. This is measured as homology in percentage. Histone H1 homology in mammals (https://en.wikipedia.org/wiki/Sequence_homology) What … Continue reading How...

Read more »

Day 02 – little helper na_omitlist

December 2, 2018
By
Day 02 – little helper na_omitlist

We at STATWORX work a lot with R and we often use the same little helper function within our projects. These functions ease our daily work life by reducing repetitive code parts or by creating overviews of our projects. At first, there was no plan to make a package, but soon I realised, that it will be much easier...

Read more »

R plus Magento 2 REST API revisited: part 3 – more complex samples of use

December 1, 2018
By

This is 3rd part of series about working with Magento 2 REST API in R. If you haven’t read previous posts in this series, I would recommend to do it. This article sample use the functions defined in previous posts. You may find them at R plus Magento 2 REST API… The post R plus Magento 2 REST API revisited:...

Read more »

TSstudio 0.1.3

December 1, 2018
By

I used the Thanksgiving break to push a new update of the TSstudio package to CRAN (version 0.1.3). The new version includes an update for the ts_backtesting function along with two new function - ts_to_prophet for converting time series objects to a prophet input format (i.e., ds and y columns), and ccf_plot for lags plot between two time series....

Read more »

Solving #AdventOfCode day 1 and 2 with R

December 1, 2018
By

Solving the first four puzzles of Advent of Code with R.

Read more »

What hyper-parameters are, and what to do with them; an illustration with ridge regression

What hyper-parameters are, and what to do with them; an illustration with ridge regression

This blog post is an excerpt of my ebook Modern R with the tidyverse that you can read for free here. This is taken from Chapter 7, which deals with statistical models. In the text below, I explain what hyper-parameters are, and as an example I run a ridge regression using the {glmnet} package. The book is still being written, so comments are...

Read more »

NYC buses: C5.0 classification with R; more than 20 minute delay?

December 1, 2018
By
NYC buses: C5.0 classification with R; more than 20 minute delay?

CategoriesAdvanced Modeling Tags Data Management Data Visualisation R Programming We are continuing on with our NYC bus breakdown problem. When we left off, we had constructed a rule-based Cubist regression model with our expanded pool of predictors; but we were still only managing to explain 37% of the data's variance with our model. Given how 'dirty' the target variable 'time_delayed' is (because it is...

Read more »

Using R: the best thing I’ve changed about my code in years

December 1, 2018
By

Hopefully, one’s coding habits are constantly improving. If you feel any doubt about yourself, I suggest looking back at something you wrote 2011. One thing I’ve changed recently that made my life so much better is a simple silly thing: meaningful name for index and counter variables. Take a look at these pieces of fake

Read more »

Day 01 – little helper checkdir

December 1, 2018
By
Day 01 – little helper checkdir

We at STATWORX work a lot with R and we often use the same little helper function within our projects. These functions ease our daily work life by reducing repetitive code parts or by creating overviews of our projects. At first, there was no plan to make a package, but soon I realised, that it will be much easier...

Read more »

Simulating dinosaur populations, with R

November 30, 2018
By

So it turns out that the 1990 Michael Crichton novel Jurassic Park is, indeed, a work of fiction. (Personal note: despite the snark to follow, the book is one of my all-time favorites — I clearly remember devouring it in 24 hours straight while ill in a hostel in France.) If the monsters and melodrama didn't give it away,...

Read more »

NYC buses: Cubist regression with more predictors

November 30, 2018
By
NYC buses: Cubist regression with more predictors

CategoriesAdvanced Modeling Tags Data Management Linear Regression R Programming We have previously added a set of company identity-agnostic predictors, such as the number of drivers a company employs, or the number of vehicles in the fleet with a hydraulic lift, and so on. we took this approach, rather than having each company as a unique predictor, so that the addition of a new contractor...

Read more »

Number of births in the twentieth century by @ellis2013nz

November 30, 2018
By
Number of births in the twentieth century by @ellis2013nz

Motivation A couple of weeks back, Branko Milanovic asked on Twitter : “Does anyone know a link to a calculation on how many people were born … in the entire 20th century?” Somewhat surprisingly, no-one did. However, there was a calculatio...

Read more »

Faster garbage collection in pqR

November 29, 2018
By
Faster garbage collection in pqR

The latest version of pqR and the version before as well use a new garbage collector, and new memory layouts for R objects, which both reduce memory usage and considerably speed up garbage collection. Here, I’ll give an overview of how the new scheme works, and present some performance comparisons with R-3.5.1.  Some more details are presented

Read more »

Creating and saving multiple plots to Powerpoint

November 29, 2018
By

At the NHS R conference we delivered a session on animating patient flow. This started with a single plot showing all patient movements, and then I demonstrated the ability to create a faceted plot. But, with many different areas, and a smal...

Read more »

Using taxa and metacoder to explore taxonomy of Vancouver’s trees

November 29, 2018
By
Using taxa and metacoder to explore taxonomy of Vancouver’s trees

From the last post, Vancouver has several common genus in its collection, such as Prunus and Acer. So rather than analyzing the diversity of Vancouver tree species on a species level, we could, with the help of some R packages, visualize the variety of taxonomic groups. First, we format the Vancouver tree dataset similar to before to get the proper...

Read more »

Search R-bloggers


Sponsors

Mango solutions





Zero Inflated Models and Generalized Linear Mixed Models with R



wiley.com/learn/datascience

datasciencego.com

Quantide: statistical consulting and training

ODSC boston

datasociety

http://www.eoda.de









Six Sigma Online Training

mljar.com

Our ads respect your privacy. Read our Privacy Policy page to learn more.

Contact us if you wish to help support R-bloggers, and place your banner here.