A duck. Giving a look at DuckDB since MonetDBLite was removed from CRAN

October 26, 2019
By
A duck. Giving a look at DuckDB since MonetDBLite was removed from CRAN

You may know that MonetDBLite was removed from CRAN.DuckDB comming up. Breaking change __ install.packages('MonetDBLite') Warning in install.packages : package ‘MonetDBLite’ is not available (for R version 3.6.1)People who based their works on MonetDBLite may ask what happened, what to do. Not to play a risky game with database and tools choices for future works… (“It’s really fast but we may...

Read more »

RAthena 1.3.0 has arrived

October 25, 2019
By

RBloggers|RBloggers-feedburner Recap: RAthena is a R package that interfaces into Amazon Athena. However, it doesn’t use the standard ODBC and JDBC drivers like AWR.Athena and metis. Instead RAthena utilises Python’s SDK (software development kit) into Amazon, Boto3. It does this by using the reticulate package that provides an interface into Python. What this means is that RAthena doesn’t require any driver...

Read more »

A Comprehensive Introduction to Command Line for R Users

October 25, 2019
By
A Comprehensive Introduction to Command Line for R Users

In this tutorial, you will be introduced to the command line. We have selected a set of commands we think will be useful in general to a wide range of audience. We have created a RStudio Cloud Project to ensure that all readers are using the same environment while going through the tutorial. Our goal was to ensure that after...

Read more »

Pivoting tidily

October 25, 2019
By
Pivoting tidily

One of the fun bits of my job is that I have actual time dedicated to helping colleagues and grad students with statistical or computational problems. Recently I’ve been helping one of our Lab Instructors with some data that from their Plant Physiology Lab course. Whilst I was writing some R code to import the raw data for the...

Read more »

dang 0.0.11: Small improvements

October 24, 2019
By

A new release of what may be my most minor package, dang, is now on CRAN. The dang package regroups a few functions of mine that had no other home as for example lsos() from a StackOverflow question from 2009 (!!) is one, this overbought/oversold pri...

Read more »

The politics of New Mexico: a brief historical-visual account

October 23, 2019
By
The politics of New Mexico: a brief historical-visual account

New Mexico demographics via tidycensus 2016 Presidential Election Presidential elections in New Mexico historically New Mexico as bellwether? Congressional delegation historically New Mexico State Government Summary In this post, we piece together a brief political history of New Mexico using a host of data sources, including Wikipedia, the US Census, the New Mexico State Legislature (NMSL), VoteView, and the National Conference of State Legislatures. A bit...

Read more »

RStudio Professional Drivers 1.6.0

October 23, 2019
By
RStudio Professional Drivers 1.6.0

Access to data is crucial for data science. Unfortunately, servers that run RStudio are often disconnected from databases, especially in organizations that are new to R. In order to help data scientists access their databases, RStudio offers ODBC data connectors that are supported, easy to install, and designed to work everywhere you use RStudio professional products. The 1.6.0 release...

Read more »

Gold-Mining Week 8 (2019)

October 23, 2019
By

Week 8 Gold Mining and Fantasy Football Projection Roundup now available -on time and ready to go! The post Gold-Mining Week 8 (2019) appeared first on Fantasy Football Analytics.

Read more »

Decision Making Support Systems #3: Differences between IA and AI

October 23, 2019
By
Decision Making Support Systems #3: Differences between IA and AI

The Differences between Artificial Intelligence and Augmented Intelligence In previous posts, we looked at the definition of Artificial Intelligence (AI) and the definition of Intelligence Augmentation (IA). So, what are the differences between the two? Intelligence Augmentation has always been concerned with aiding human decision making and keeping humans-in-the-loop, whereas the AI endeavor seeks to Article Decision Making Support...

Read more »

Horizontal scaling of data science applications in the cloud

October 23, 2019
By
Horizontal scaling of data science applications in the cloud

Prediction models, machine learning algorithms and scripts for data storage: The modern data science application not only shows more and more complexity, but also puts the existing infrastructure to the test by temporary resource peaks. In this article, we will show how tools such as the RStudio Job Launcher in conjunction with a Kubernetes cluster

Read more »

linl 0.0.4: Now with footer

October 23, 2019
By
linl 0.0.4: Now with footer

A new release of our linl package for writing LaTeX letters with (R)markdown just arrived on CRAN. linl makes it easy to write letters in markdown, with some extra bells and whistles thanks to some cleverness chiefly by Aaron. This version now suppor...

Read more »

pkgKitten 0.1.5: Creating R Packages that purr

October 22, 2019
By
pkgKitten 0.1.5: Creating R Packages that purr

Another minor release 0.1.5 of pkgKitten just hit on CRAN today, after a break of almost three years. This release provides a few small changes. The default per-package manual page now benefits from a second refinement (building on what was introduc...

Read more »

Council spending – open data

October 22, 2019
By
Council spending – open data

My local authority recently decided to publish all spending over £500 in an effort to be more transparent. Here’s a post taking an overview of what they’ve published. I’ve used R for the… Continue reading →

Read more »

Super Solutions for Shiny Architecture #5 of 5: Automated Tests

October 22, 2019
By
Super Solutions for Shiny Architecture #5 of 5:  Automated Tests

TL;DR Describes the best practices for setting automated test architecture for Shiny apps. Automate and test early and often with unit tests, user interface tests, and performance tests. Best Practices for Testing Your Shiny App Even your best apps will break down at some point during development or during User Acceptance Tests. I can bet Article Super Solutions for...

Read more »

Understanding Blockchain Technology by building one in R

October 22, 2019
By
Understanding Blockchain Technology by building one in R

By now you will know that it is a good tradition of this blog to explain stuff by rebuilding toy examples of it in R (see e.g. Understanding the Maths of Computed Tomography (CT) scans, So, what is AI really? or Google’s Eigenvector… or how a Random Surfer finds the most relevant Webpages). This time … Continue reading "Understanding...

Read more »

digest 0.6.22: More goodies!

October 21, 2019
By

A new version of digest arrived at CRAN earlier today, and I just sent an updated package to Debian too. digest creates hash digests of arbitrary R objects (using the md5, sha-1, sha-256, sha-512, crc32, xxhash32, xxhash64, murmur32, and spookyhash a...

Read more »

Ordering Sentinel-2 products from Long Term Archive with sen2r

October 21, 2019
By
Ordering Sentinel-2 products from Long Term Archive with sen2r

Until August 2019, all Sentinel-2 satellite data could be directly downloaded from the ESA Data Hub, both through the interactive Open Hub or using an API interface. Recently this policy was changed: typically, only the most recent products are available for direct download, while the oldest ones (level 2A archives older than 18 months and level 1C older than one year) are stored in the so...

Read more »

Access the free economic database DBnomics with R

October 21, 2019
By
Access the free economic database DBnomics with R

DBnomics : the world’s economic database Explore all the economic data from different providers (national and international statistical institutes, central banks, etc.), for free, following the link db.nomics.world. You can also retrieve all the economic data through the rdbnomics package here. This blog post describes the different ways to do so. Fetch time series by ids First, let’s assume that we know which series we...

Read more »

Trends in U.S. Border Crossing Entry since 1996

October 21, 2019
By
Trends in U.S. Border Crossing Entry since 1996

Introduction Since the 2016 election, inland U.S. Border security has been the huge topic. The construction for the new border wall has started and the tension between Mexico and U.S. has intensified along with it. Many people predicted not only the decrease in number of illegal border entry but also the decrease in number of

Read more »

Split Intermixed Names into First, Middle, and Last

October 21, 2019
By

Data cleaning can be a challenge, so I hope this helps the process for someone out there. This is a tiny, but valuable function for…Continue ReadingSplit Intermixed Names into First, Middle, and Last

Read more »

Gold-Mining Week 7 (2019)

October 21, 2019
By

Week 7 Gold Mining and Fantasy Football Projection Roundup now available. The post Gold-Mining Week 7 (2019) appeared first on Fantasy Football Analytics.

Read more »

Widening Multiple Columns Redux

October 21, 2019
By

Last year I wrote about the slightly tedious business of spreading (or widening) multiple value columns in Tidyverse-flavored R. Recent updates to the tidyr package, particularly the introduction of the pivot_wider() and pivot_longer() functions, have made this rather more straightforward to do than before. Here I recapitulate the earlier example with the new tools. The motivating case is something that...

Read more »

Avoiding embarrassment by testing data assumptions with expectdata

October 21, 2019
By
Avoiding embarrassment by testing data assumptions with expectdata

Expectdata is an R package that makes it easy to test assumptions about a data frame before conducting analyses. Below is a concise tour of some of the data assumptions expectdata can test for you. For example, Note: assertr is an ropensci project t...

Read more »

(Much) faster unnesting with data.table

Today I was struggling with a relatively simple operation: unnest() from the tidyr package. What it’s supposed to do is pretty simple. When you have a data.frame where one or multiple columns are lists, you can unlist these columns while duplicating the information in other columns if the length of an element is larger than 1. library(tibble) df

Read more »

IPO Exploration

October 20, 2019
By
IPO Exploration

Inspired by recent headlines like “Fear Overtakes Greed in IPO Market after WeWork Debacle” and “This Year’s IPO Class is Least Profitable since the Tech Bubble”, today we’ll explore historical IPO data, and next time we’ll look at the the performance of IPO-driven portfolios constructed during the ten-year period from 2004 to 2014. I’ll admit, I’ve often wondered how...

Read more »

Bootstrapping time series for improving forecasting accuracy

October 20, 2019
By
Bootstrapping time series for improving forecasting accuracy

Bootstrapping time series? It is meant in a way that we generate multiple new training data for statistical forecasting methods like ARIMA or triple exponential smoothing (Holt-Winters method etc.) to improve forecasting accuracy. It is called bootstra...

Read more »

rmangal: making ecological networks easily accessible

rmangal: making ecological networks easily accessible

In early September, the version 2.0.0 of rmangal was approved by rOpenSci, four weeks later it made it to CRAN. Following-up on our experience we detail below the reasons why we wrote rmangal, why we submitted our package to rOpenSci and how the peer review improved our package. Mangal, a database for ecological networks Ecological networks are defined as a set of species populations...

Read more »

RcppGSL 0.3.7: Fixes and updates

October 20, 2019
By

A new release 0.3.7 of RcppGSL is now on CRAN. The RcppGSL package provides an interface from R to the GNU GSL using the Rcpp package. Stephen Wade noticed that we were not actually freeing memory from the GSL vectors and matrices as we set out to do...

Read more »

Permutation Feature Importance (PFI) of GRNN

October 19, 2019
By
Permutation Feature Importance (PFI) of GRNN

In the post https://statcompute.wordpress.com/2019/10/13/assess-variable-importance-in-grnn, it was shown how to assess the variable importance of a GRNN by the decrease in GoF statistics, e.g. AUC, after averaging or dropping the variable of interest. The permutation feature importance evaluates the variable importance in a similar manner by permuting values of the variable, which attempts to break the

Read more »

Search R-bloggers

Sponsors