## Day 07 – little helper count_na

December 7, 2018
By

We at STATWORX work a lot with R and we often use the same little helper functions within our projects. These functions ease our daily work life by reducing repetitive code parts or by creating overviews of our projects. At first, there was no plan to make a package, but soon I realised, that it will be much easier...

## Bayesian Nonparametric Models in NIMBLE, Part 2: Nonparametric Random Effects

December 6, 2018
By
$Bayesian Nonparametric Models in NIMBLE, Part 2: Nonparametric Random Effects$

Bayesian nonparametrics in NIMBLE: Nonparametric random effects Overview NIMBLE is a hierarchical modeling package that uses nearly the same language for model specification as the popular MCMC packages WinBUGS, OpenBUGS and JAGS, while making the modeling language extensible — you can add distributions and functions — and also allowing customization of the algorithms used to

## Rrrrs in R – Letter frequency in R package names

December 6, 2018
By

R package authors sometimes like to add the letter “r” to package names (for example, the tidyverse packages). baRcodeR also has an extra “r” at the end as well. I thought I could use some available data see if the letter frequency changes compared to the English language average. I used two data sets. The first is the percentage frequency...

## Statistics in Glaucoma: Part II

December 6, 2018
By

Samuel Berchuck is a Postdoctoral Associate in Duke University’s Department of Statistical Science and Forge-Duke’s Center for Actionable Health Data Science. Joshua L. Warren is an Assistant Professor of Biostatistics at Yale University. Analyzing Visual Field Data In Part I of this series on statistic in glaucoma, we detailed the use of visual fields for understanding functional vision loss in glaucoma patients....

## Network Centrality in R: An Introduction

December 6, 2018
By

This is the first post of a series on the concept of “network centrality” with applications in R and the package netrankr. There is already a rudimentary tutorial for the package, but I wanted to extend it to a broader tutorial for network centrality. The main focus of the blog series will be the applications in R and conceptual considerations will only...

## Intuition for principal component analysis (PCA)

December 6, 2018
By

Principal component analysis (PCA) is a dimension-reduction method that can be used to reduce a large set of (often correlated) variables into a smaller set of (uncorrelated) variables, called principal components, which still contain most of the information. PCA is a concept that is traditionally hard to grasp so instead of giving you the n’th … Continue reading "Intuition...

## Automated Dashboard visualizations with Deviation in R

December 6, 2018
By

CategoriesProgramming Tags Data Visualisation R Markdown R Programming Tips & Tricks In this article, you learn how to make Automated Dashboard visualizations with Deviation in R. First you need to install the `rmarkdown` package into your R library. Assuming that you installed the `rmarkdown`, next you create a new `rmarkdown` script in R. After this you type the following code in order to create a...

## Day 06 – little helper statusbar

December 6, 2018
By

We at STATWORX work a lot with R and we often use the same little helper functions within our projects. These functions ease our daily work life by reducing repetitive code parts or by creating overviews of our projects. At first, there was no plan to make a package, but soon I realised, that it will be much easier...

## Running an R script on heroku

December 5, 2018
By

In this post I will show you how to run an R script on heroku every day. This is a continuation of my previous post on tweeting a death from wikidata. Why would I want to run a script on heroku? It is extremely simple, you don’t need to spin up a machine in the cloud on AWS, Google, Azure or...

## The JapanR Conference 2018 Round-Up!

December 5, 2018
By

This past weekend was the 9th JapanR Conference hosted at LINE Corporation in Tokyo, Japan! I’ve been back in Japan for nearly a year now and I’ve been going to nearly every one of the R user meetups here, TokyoR, and it’s ...

## Gender Diversity in the R and Python Communities

December 5, 2018
By

Many (if not most) tech communities have far more representation from men than from women (and even fewer from nonbinary folk). This is a shame, because everybody uses software, and these projects would self-evidently benefit from the talent and expertise from across the entire community. Some projects are doing better than others, though, and data scientist Reshama Shaikh recently...

## Extract data from a PNG/TIFF

December 5, 2018
By

Sometimes it’s useful to be able to extract data from a published figure. If the figure isn’t a vector based format (for which the numeric data is probably still in the file), it’s possible to digitize the image with R, click the points and extract it that way. The digitize package is simple to use

## Creating Tables Using R and Pure HTML

December 5, 2018
By

A problem with R is that its tables are not good enough to share with non-R users, both in terms of visual attractiveness and ease...

## Automated Dashboard with various correlation visualizations in R

December 5, 2018
By

CategoriesProgramming Tags Correlation Data Visualisation R Programming In this article, you learn how to make Automated Dashboard with various correlation visualizations in R. First you need to install the `rmarkdown` package into your R library. Assuming that you installed the `rmarkdown`, next you create a new `rmarkdown` script in R. After this you type the following code in order to create a Related Post Automated...

## Day 05 – little helper get_network

December 5, 2018
By

We at STATWORX work a lot with R and we often use the same little helper functions within our projects. These functions ease our daily work life by reducing repetitive code parts or by creating overviews of our projects. At first, there was no plan to make a package, but soon I realised, that it will be much easier...

## ggQC | ggplot Quality Control Charts – New Release

December 4, 2018
By

The ggQC package is a quality control extension for ggplot. Use it to create XmR, XbarR, C and many other highly customizable Control Charts. Additional statistical process control functions include Shewart violation checks as well as capability analysis. If your process is running smoothly, visualize the potential impacted of your next process improvement with a Pareto chart. To learn...

## Trust in ML models. Slides from TWiML & AI EMEA Meetup + iX Articles

December 4, 2018
By

Here you find my slides from last nights TWiML & AI EMEA Meetup about Trust in ML models, where I presented the Anchors paper by Carlos Guestrin et al.. I have also just written two articles for the German IT magazin iX about the same topic of...

## Community Call – Governance strategies for open source research software projects

🎤 Dan Sholler, rOpenSci Postdoctoral Fellow 🕘 Tuesday, December 18, 2018, 10-11AM PST; 7-8PM CET (find your timezone) ☎️ Details for joining the Community Call. Everyone is welcome. No RSVP needed. Researchers use open source software for the capabilities it provides, such as streamlined data access and analysis and interoperability with other pieces of the scientific computing ecosystem. For most complex software,...

## Heatmaps of Mortality Rates

December 4, 2018
By

As part of the run-up to the release of Data Visualization (out in about ten days! Currently 30% off on Amazon!), I’ve been playing with graphing different kinds of data. One great source of rich time-series data is mortality.org, which hosts a collection of standardized demographic data for a large number of countries. Mortality rates are often interesting to...

## Bayesian Nonparametric Models in NIMBLE, Part 1: Density Estimation

December 4, 2018
By
$Bayesian Nonparametric Models in NIMBLE, Part 1: Density Estimation$

Bayesian Nonparametric Models in NIMBLE, Part 1: Density Estimation Bayesian nonparametrics in NIMBLE: Density estimation Overview NIMBLE is a hierarchical modeling package that uses nearly the same language for model specification as the popular MCMC packages WinBUGS, OpenBUGS and JAGS, while making the modeling language extensible — you can add distributions and functions — and

## Deep learning in Satellite imagery

December 4, 2018
By

In this article, I hope to inspire you to start exploring satellite imagery datasets. Recently, this technology has gained huge momentum, and we are finding that new possibilities arise when we use satellite image analysis. Satellite data changes the game because it allows us to gather new information that is not readily available to businesses. Artykuł Deep learning in...

## Starspace for NLP #nlproc

December 4, 2018
By

Our recent addition to the NLP R universe is called R package ruimtehol which is open sourced at https://github.com/bnosac/ruimtehol This R package is a wrapper around Starspace which provides a neural embedding model for doing the following on text: Text classification Learning word, sentence or document level embeddings Finding sentence or document similarity Ranking web documents Content-based recommendation (e.g. recommend text/music based on the...

## Day 04 – little helper evenstrings

December 4, 2018
By

We at STATWORX work a lot with R and we often use the same little helper functions within our projects. These functions ease our daily work life by reducing repetitive code parts or by creating overviews of our projects. At first, there was no plan to make a package, but soon I realised, that it will be much easier...

## Solving #AdventOfCode day 5 and 6 with R

December 3, 2018
By

Solving the puzzles of Advent of Code with R.

## Solving #AdventOfCode day 3 and 4 with R

December 3, 2018
By

Solving the puzzles of Advent of Code with R.

## My 2018 #rstats year in review

December 3, 2018
By

This past year in R has been a good one for me; productive, exciting, different. I decided it was worth taking a moment to reflect and share. Goals were set and met. There were also unexpected changes. CRAN A year ago, at the tail end of 2017, I pu...

## rnoaa: new data sources and NCDC units

We’ve just released a new version of rnoaa with A LOT of changes. Check out the release notes for a complete list of changes. We’ll highlight a few things in this post: New data sources in the package NCDC units added to the output of ncdc() Links: rnoaa source code: https://github.com/ropensci/rnoaa rnoaa on CRAN: https://cran.rstudio.com/web/packages/rnoaa/ Installation Install the lastest from CRAN install.packages("rnoaa") Some binaries are not up yet on CRAN -...

## Detecting spatiotemporal groups in relocation data with spatsoc

spatsoc is an R package written by Alec Robitaille, Quinn Webber and Eric Vander Wal of the Wildlife Evolutionary Ecology Lab (WEEL) at Memorial University of Newfoundland. It is the lab’s first R package and was recently accepted through the rOpenSci onboarding process with a big thanks to reviewers Priscilla Minotti and Filipe Teixeira, and editor Lincoln Mullen. spatsoc started...

## GARCH and a rudimentary application to Vol Trading

December 3, 2018
By

This post will review Kris Boudt’s datacamp course, along with introducing some concepts from it, discuss GARCH, present an application … Continue reading →