digest 0.6.22: More goodies!

October 21, 2019
By

A new version of digest arrived at CRAN earlier today, and I just sent an updated package to Debian too. digest creates hash digests of arbitrary R objects (using the md5, sha-1, sha-256, sha-512, crc32, xxhash32, xxhash64, murmur32, and spookyhash a...

Read more »

Trends in U.S. Border Crossing Entry since 1996

October 21, 2019
By
Trends in U.S. Border Crossing Entry since 1996

Introduction Since the 2016 election, inland U.S. Border security has been the huge topic. The construction for the new border wall has started and the tension between Mexico and...

Read more »

Split Intermixed Names into First, Middle, and Last

October 21, 2019
By

Data cleaning can be a challenge, so I hope this helps the process for someone out there. This is a tiny, but valuable function for…Continue ReadingSplit Intermixed Names into...

Read more »

Gold-Mining Week 7 (2019)

October 21, 2019
By

Week 7 Gold Mining and Fantasy Football Projection Roundup now available. The post Gold-Mining Week 7 (2019) appeared first on Fantasy Football Analytics.

Read more »

Widening Multiple Columns Redux

October 21, 2019
By

Last year I wrote about the slightly tedious business of spreading (or widening) multiple value columns in Tidyverse-flavored R. Recent updates to the tidyr package, particularly the introduction of...

Read more »

Avoiding embarrassment by testing data assumptions with expectdata

October 21, 2019
By
Avoiding embarrassment by testing data assumptions with expectdata

Expectdata is an R package that makes it easy to test assumptions about a data frame before conducting analyses. Below is a concise tour of some of the data...

Read more »

Bootstrapping time series for improving forecasting accuracy

October 20, 2019
By
Bootstrapping time series for improving forecasting accuracy

Bootstrapping time series? It is meant in a way that we generate multiple new training data for statistical forecasting methods like ARIMA or triple exponential smoothing (Holt-Winters method etc.)...

Read more »

rmangal: making ecological networks easily accessible

rmangal: making ecological networks easily accessible

In early September, the version 2.0.0 of rmangal was approved by rOpenSci, four weeks later it made it to CRAN. Following-up on our experience we detail below the reasons why we...

Read more »

RcppGSL 0.3.7: Fixes and updates

October 20, 2019
By

A new release 0.3.7 of RcppGSL is now on CRAN. The RcppGSL package provides an interface from R to the GNU GSL using the Rcpp package. Stephen Wade noticed...

Read more »

Permutation Feature Importance (PFI) of GRNN

October 19, 2019
By
Permutation Feature Importance (PFI) of GRNN

In the post https://statcompute.wordpress.com/2019/10/13/assess-variable-importance-in-grnn, it was shown how to assess the variable importance of a GRNN by the decrease in GoF statistics, e.g. AUC, after averaging or dropping the...

Read more »

Building a Corporate R Package for Pleasure and Profit

October 19, 2019
By
Building a Corporate R Package for Pleasure and Profit

The “Great Restructuring” of our economy is underway. That’s the official name for what we know is happening: the best are rising to the top, and the mediocre are...

Read more »

Partial Dependence Plot (PDP) of GRNN

October 19, 2019
By
Partial Dependence Plot (PDP) of GRNN

The function grnn.margin() (https://github.com/statcompute/yager/blob/master/code/grnn.margin.R) was my first attempt to explore the relationship between each predictor and the response in a General Regression Neural Network, which usually is considered the...

Read more »

SQL Server Schemas & R Tip

October 18, 2019
By
SQL Server Schemas & R Tip

I ran into an issue the other day where I was tring to write a new table to a SQL Server Database with a non-default schema. I did end up...

Read more »

How confident are you? Assessing the uncertainty in forecasting

October 18, 2019
By
How confident are you? Assessing the uncertainty in forecasting

Introduction Some people think that the main idea of forecasting is in predicting the future as accurately as possible. I have bad news for them. The main idea of...

Read more »

Vignette: Google Trends with the gtrendsR package

October 17, 2019
By
Vignette: Google Trends with the gtrendsR package

Background Google Trends is a well-known, free tool provided by Google that allows you to analyse the popularity of top search queries on its Google search engine. In market...

Read more »

three birthdays and a numeral

October 17, 2019
By
three birthdays and a numeral

The riddle of the week on The Riddler was to find the size n of an audience for at least a 50% chance of observing at least one triplet...

Read more »

Practical Data Science with R 2nd Edition update

October 17, 2019
By

We are in the last stages of proofing the galleys/typesetting of Zumel, Mount, Practical Data Science with R, 2nd Edition, Manning 2019. So this edition will definitely be out...

Read more »

Job: Junior Systems Administrator (with a focus on R/Python)

October 17, 2019
By

Jumping Rivers is a data science consultancy company focused on R and Python. We work across industries and throughout the world. We offer a mixture of training, modelling, and...

Read more »

rBokeh – Don’t be stopped by missing arguments!

October 17, 2019
By
rBokeh – Don’t be stopped by missing arguments!

rBokeh is an interactive plotting library. Since it functions lack some arguments compared to its Python counterpart, plots are sometimes difficult to customize. I will show how to overcome...

Read more »

Repetitive Q: Reading Multiple Files in the Zip Folder

October 17, 2019
By

Dear Readers,I always see a repetitive question coming to me and across various forums on how to read multiple files in the zip folder of same separator or multiple...

Read more »

2 Months in 2 Minutes – rOpenSci News, October 2019

2 Months in 2 Minutes – rOpenSci News, October 2019

rOpenSci HQ What would you like to hear about in an rOpenSci Community Call? We are soliciting your “votes” and new ideas for Community Call topics and speakers. Find out...

Read more »

Productionizing Shiny and Plumber with Pins

October 16, 2019
By
Productionizing Shiny and Plumber with Pins

Producing an API that serves model results or a Shiny app that displays the results of an analysis requires a collection of intermediate datasets and model objects, all of...

Read more »

Map coloring: the color scale styles available in the tmap package

Map coloring: the color scale styles available in the tmap package

This vignette builds on the making maps chapter of the Geocomputation with R book. Its goal is to demonstrate all possible map styles available in the tmap package. Prerequisites The examples below...

Read more »

Non-Gaussian forecasting using fable

October 16, 2019
By
Non-Gaussian forecasting using fable

library(tidyverse) library(tsibble) library(lubridate) library(feasts) library(fable) In my previous post about the new fable package, we saw how fable can produce forecast distributions, not just point forecasts. All my examples used Gaussian (normal) distributions, so...

Read more »

Automatic DAG learning – part 1

October 16, 2019
By
Automatic DAG learning – part 1

I was really struggling with finding a header pic for this post when I came across the one above - titled “Dag scoring and selection” and since it’s sort...

Read more »

Advancing Text Mining with R and quanteda

October 16, 2019
By
Advancing Text Mining with R and quanteda

Everyone is talking about text analysis. Is it puzzling that this data source is so popular right now? Actually no. Most of our datasets rely on (hand-coded) textual information....

Read more »

Using bwimge R package to describe patterns in images of natural structures

October 15, 2019
By
Using bwimge R package to describe patterns in images of natural structures

This tutorial illustrates how to use the bwimge R package (Biagolini-Jr 2019) to describe patterns in images of natural structures. Digital images are basically two-dimensional objects composed by cells...

Read more »

Private Security and the Pareto Principle

October 15, 2019
By
Private Security and the Pareto Principle

Private Security is a big industry in South Africa. Most Private Security companies promise to provide a rapid response to every callout generated by any of their customers. There...

Read more »

New package: GetEdgarData

October 15, 2019
By
New package: GetEdgarData

Introduction Every company traded in the US stock market must report its quarterly and yearly documents to the SEC and the public in general. This includes its accounting statements (10-K,...

Read more »

Search R-bloggers

Sponsors