Detecting spatiotemporal groups in relocation data with spatsoc

Detecting spatiotemporal groups in relocation data with spatsoc

spatsoc is an R package written by Alec Robitaille, Quinn Webber and Eric Vander Wal of the Wildlife Evolutionary Ecology Lab (WEEL) at Memorial University of Newfoundland. It is the lab’s first R package and was recently accepted through the rOpenSci onboarding process with a big thanks to reviewers Priscilla Minotti and Filipe Teixeira, and editor Lincoln Mullen. spatsoc started...

Read more »

GARCH and a rudimentary application to Vol Trading

December 3, 2018
By
GARCH and a rudimentary application to Vol Trading

This post will review Kris Boudt’s datacamp course, along with introducing some concepts from it, discuss GARCH, present an application … Continue reading →

Read more »

Very Non-Standard Calling in R

December 3, 2018
By

Our group has done a lot of work with non-standard calling conventions in R. Our tools work hard to eliminate non-standard calling (as is the purpose of wrapr::let()), or at least make it cleaner and more controllable (as is done in the wrapr dot pipe). And even so, we still get surprised by some of … Continue reading Very...

Read more »

Making a Profit with Henry Wan in Arkham Horror: The Card Game

December 3, 2018
By
Making a Profit with Henry Wan in Arkham Horror: The Card Game

Introduction The Forgotten Age cycle of Arkham Horror is at a close and Fantasy Flight Games already announced the next cycle, The Circle Undone. Not only that, they’ve announced two mythos packs at a rate that… surprised me. A new cycle announcement and two mythos pack announcements in less than two months? Am I the…Read more Making a Profit...

Read more »

AzureVM: managing virtual machines in Azure

December 3, 2018
By

This is the next article in my series on AzureR, a family of packages for working with Azure in R. I’ll give a short introduction on how to use AzureVM to manage Azure virtual machines, and in particular Data Science Virtual Machines (DSVMs). Creating a VM Creating a VM is as simple as using the create_vm method, which is...

Read more »

Automated Dashboard with Visualization and Regression for Healthcare Data

December 3, 2018
By
Automated Dashboard with Visualization and Regression for Healthcare Data

CategoriesProgramming Tags Data Visualisation Linear Regression R Programming Tips & Tricks In this article, you learn how to make a Automated Dashboard with Visualization and Regression for Healthcare Data in R. First you need to install the `rmarkdown` package into your R library. Assuming that you installed the `rmarkdown`, next you create a new `rmarkdown` script in R. I use the healthcare insurance data from...

Read more »

Day 03 – little helper multiplot

December 3, 2018
By
Day 03 – little helper multiplot

We at STATWORX work a lot with R and we often use the same little helper functions within our projects. These functions ease our daily work life by reducing repetitive code parts or by creating overviews of our projects. At first, there was no plan to make a package, but soon I realised, that it will be much easier...

Read more »

An Utility Function For Monotonic Binning

December 2, 2018
By

In all monotonic algorithms that I posted before, I heavily relied on the smbinning::smbinning.custom() function contributed by Herman Jopia as the utility function generating the binning output and therefore feel deeply indebted to his excellent work. However, the availability of smbinning::smbinning.custom() function shouldn’t become my excuse for being lazy. Over the weekend, I drafted a

Read more »

Linear Classification Models – Hepatic Dataset

December 2, 2018
By
Linear Classification Models – Hepatic Dataset

This post is following exercise 1 in Chapter 12 of Applied Predicative Modeling. Here I use the machine learning package CARET in R to make classification models; in particular, the linear classification models discussed in Chapter 12. The dataset in question is about hepatic injury (liver damage). It includes a dataframe of biological related predictors of liver damage bio, a...

Read more »

Introduction to chartbookR

Introduction to chartbookR

”“Data scientists … spend from 50 percent to 80 percent of their time … collecting and preparing unruly digital data, before it can be explored for useful nuggets.” (— New York Times). The chartbookR package allows the convenient creation of economic and financial data chartbooks. It handles most of the data wrangling and can thus create large chartbooks with few...

Read more »

Statistics in Glaucoma: Part I

December 2, 2018
By
Statistics in Glaucoma: Part I

Samuel Berchuck is a Postdoctoral Associate in Duke University’s Department of Statistical Science and Forge-Duke’s Center for Actionable Health Data Science. Joshua L. Warren is an Assistant Professor of Biostatistics at Yale University. Introduction Glaucoma is a leading cause of blindness worldwide, with a prevalence of 4% in the population aged 40-80. The disease is characterized by retinal ganglion cell death and...

Read more »

One Recipe Step to Rule Them All

In this post I will demonstrate, how my new R package customsteps can be used to create recipe steps, that apply custom transformations to a data set. Note, you should already be fairly familiar with the recipes package before you continue reading this post or give customsteps a spin! Recommended music for this reading session: Introducing the customsteps package Along with the recipes...

Read more »

Compare population age structures of Europe NUTS-3 regions and the US counties using ternary color-coding

December 2, 2018
By
Compare population age structures of Europe NUTS-3 regions and the US counties using ternary color-coding

On 28 November 2018 I presented a poster at Dutch Demography Day in Utrecht. Here it is:

Read more »

restez: Query GenBank locally

restez: Query GenBank locally

What is restez? R packages for interacting with the National Center for Biotechnology Information (NCBI) have, to-date, depended on API query calls via NCBI’s Entrez. For computational analyses that require the automated look-up of reams of biological sequence data, piecemeal querying via bandwith-limited requests is evidently not ideal. These queries are not only slow, but they depend on network connections and...

Read more »

Install and Load Multiple R Packages

December 2, 2018
By

In enterprise environment, we generally need to automate the process of installing multiple R packages so that user does not have to install them separately before submitting your program. The function below performs the following operations - First it finds all the already installed R packages Check packages which we want to install are already installed or not. If package is already installed,...

Read more »

2018-12 MetaPost Three Ways

December 2, 2018
By

This report describes three different approaches to communicating between R and MetaPost: importing the PostScript output from MetaPost with the ‘grImport’ package; calling the mpost ...

Read more »

Why R for data science – and not Python?

December 2, 2018
By

There are literally hundreds of programming languages out there, e.g. the whole alphabet of one letter programming languages is taken. In the area of data science there are two big contenders: R and Python. Now why is this blog about R and not Python? I have to make a confession: I really wanted to like … Continue reading "Why...

Read more »

December Reading for Econometricians

December 2, 2018
By

My suggestions for papers to read during December:Askanazi, R., F. X. Diebold, F. Schorfheide, & M. Shin, 2018. On the comparison of interval forecasts. PIER Working Paper 18-013, Penn. Institute for Economic Research, University of Pennsylvania.Me...

Read more »

How to get the homology of a antibody using R

December 2, 2018
By
How to get the homology of a antibody using R

What is homology? Proteins are conserved bio molecules present in all organisms. Some proteins are conserved in many similar species, making them homologues of each other, meaning that the sequence is not the same but there is a degree of similarity. This is measured as homology in percentage. Histone H1 homology in mammals (https://en.wikipedia.org/wiki/Sequence_homology) What … Continue reading How...

Read more »

Day 02 – little helper na_omitlist

December 2, 2018
By
Day 02 – little helper na_omitlist

We at STATWORX work a lot with R and we often use the same little helper function within our projects. These functions ease our daily work life by reducing repetitive code parts or by creating overviews of our projects. At first, there was no plan to make a package, but soon I realised, that it will be much easier...

Read more »

R plus Magento 2 REST API revisited: part 3 – more complex samples of use

December 1, 2018
By

This is 3rd part of series about working with Magento 2 REST API in R. If you haven’t read previous posts in this series, I would recommend to do it. This article sample use the functions defined in previous posts. You may find them at R plus Magento 2 REST API… The post R plus Magento 2 REST API revisited:...

Read more »

TSstudio 0.1.3

December 1, 2018
By

I used the Thanksgiving break to push a new update of the TSstudio package to CRAN (version 0.1.3). The new version includes an update for the ts_backtesting function along with two new function - ts_to_prophet for converting time series objects to a prophet input format (i.e., ds and y columns), and ccf_plot for lags plot between two time series....

Read more »

Solving #AdventOfCode day 1 and 2 with R

December 1, 2018
By

Solving the first four puzzles of Advent of Code with R.

Read more »

What hyper-parameters are, and what to do with them; an illustration with ridge regression

What hyper-parameters are, and what to do with them; an illustration with ridge regression

This blog post is an excerpt of my ebook Modern R with the tidyverse that you can read for free here. This is taken from Chapter 7, which deals with statistical models. In the text below, I explain what hyper-parameters are, and as an example I run a ridge regression using the {glmnet} package. The book is still being written, so comments are...

Read more »

NYC buses: C5.0 classification with R; more than 20 minute delay?

December 1, 2018
By
NYC buses: C5.0 classification with R; more than 20 minute delay?

CategoriesAdvanced Modeling Tags Data Management Data Visualisation R Programming We are continuing on with our NYC bus breakdown problem. When we left off, we had constructed a rule-based Cubist regression model with our expanded pool of predictors; but we were still only managing to explain 37% of the data's variance with our model. Given how 'dirty' the target variable 'time_delayed' is (because it is...

Read more »

Using R: the best thing I’ve changed about my code in years

December 1, 2018
By

Hopefully, one’s coding habits are constantly improving. If you feel any doubt about yourself, I suggest looking back at something you wrote 2011. One thing I’ve changed recently that made my life so much better is a simple silly thing: meaningful name for index and counter variables. Take a look at these pieces of fake

Read more »

Day 01 – little helper checkdir

December 1, 2018
By
Day 01 – little helper checkdir

We at STATWORX work a lot with R and we often use the same little helper function within our projects. These functions ease our daily work life by reducing repetitive code parts or by creating overviews of our projects. At first, there was no plan to make a package, but soon I realised, that it will be much easier...

Read more »

Til I Die: Seeking new music

November 30, 2018
By

I’ve been following the tweets from an account called Albums You Must Hear @Albums2Hear. Each tweet is an album recommended by the account owner. I’m a sucker for lists of Albums That I Must Hear Before I Die since I’m always interested in new (or not so new) music recommendations. I wanted to assemble a

Read more »

Simulating dinosaur populations, with R

November 30, 2018
By

So it turns out that the 1990 Michael Crichton novel Jurassic Park is, indeed, a work of fiction. (Personal note: despite the snark to follow, the book is one of my all-time favorites — I clearly remember devouring it in 24 hours straight while ill in a hostel in France.) If the monsters and melodrama didn't give it away,...

Read more »

Search R-bloggers

Sponsors

Mango solutions





Zero Inflated Models and Generalized Linear Mixed Models with R



Quantide: statistical consulting and training

ODSC boston

http://www.eoda.de











Six Sigma Online Training

Our ads respect your privacy. Read our Privacy Policy page to learn more.

Contact us if you wish to help support R-bloggers, and place your banner here.