RcppEigen 0.3.3.7.0

November 16, 2019
By

A new minor release 0.3.3.7.0 of RcppEigen arrived on CRAN today (and just went to Debian too) bringing support for Eigen 3.3.7 to R. This release comes almost a year after the previous minor release 0.3.3.5.0. Besides the upgrade to the new upstream...

Read more »

Practical Data Science with R, 2nd Edition, IS OUT!!!!!!!

November 15, 2019
By
Practical Data Science with R, 2nd Edition, IS OUT!!!!!!!

Practical Data Science with R, 2nd Edition author Dr. Nina Zumel, with a fresh author’s copy of her book!

Read more »

The hidden diagnostic plots for the lm object

November 14, 2019
By
The hidden diagnostic plots for the lm object

When plotting an lm object in R, one typically sees a 2 by 2 panel of diagnostic plots, much like the one below: This link has an excellent explanation...

Read more »

Gold-Mining Week 11 (2019)

November 14, 2019
By

Week 11 Gold Mining and Fantasy Football Projection Roundup now available. The post Gold-Mining Week 11 (2019) appeared first on Fantasy Football Analytics.

Read more »

IPO Exploration Part Two

November 13, 2019
By

In a previous post, we explored IPOs and IPO returns by sector and year since 2004. Today, let’s investigate how portfolios formed with those IPOs have performed. We will...

Read more »

workloopR: Analysis of work loops and other data from muscle physiology experiments in R

workloopR: Analysis of work loops and other data from muscle physiology experiments in R

Studies of muscle physiology often rely on closed-source, proprietary software for not only recording data but also for data wrangling and analyses. Although specialized software might be necessary to...

Read more »

Machine Learning in R: Start with an End-to-End Test

November 13, 2019
By
Machine Learning in R: Start with an End-to-End Test

As a data scientist, you will likely be asked one day to automate your analysis and port your models to production environments. When that happens you cross the blurry...

Read more »

Durban EDGE DataQuest

November 12, 2019
By
Durban EDGE DataQuest

The Durban EDGE (Economic Development and Growth in eThekwini) DataQuest was held at UKZN (Westville Campus) on 13 November 2019. Participants were tasked with creating something interesting and useful...

Read more »

The Colour of Everything

November 12, 2019
By
The Colour of Everything

I’m happy to announce that farver 2.0 has landed on CRAN. This is a big release comprising of a rewrite of much of the internals along with a range of...

Read more »

Automating update of a fiscal database for the Euro Area

November 12, 2019
By
Automating update of a fiscal database for the Euro Area

Our purpose is to write a program to automatically update a quarterly fiscal database for the Euro Area. The main difficulty of this exercise is to build long series...

Read more »

When Cross-Validation is More Powerful than Regularization

November 12, 2019
By
When Cross-Validation is More Powerful than Regularization

Regularization is a way of avoiding overfit by restricting the magnitude of model coefficients (or in deep learning, node weights). A simple example of regularization is the use of...

Read more »

Logistic Regression in R: A Classification Technique to Predict Credit Card Default

November 12, 2019
By
Logistic Regression in R: A Classification Technique to Predict Credit Card Default

Logistic Regression is one of the most popular classification techniques. In this sneak peek from Data Science Dojo's bootcamp, you'll learn about this popular algorithm and go through a...

Read more »

AzureR updates: AzureStor, AzureVM, AzureGraph, AzureContainers

November 12, 2019
By

Some major updates to AzureR packages this week! As well as last week's AzureRMR update, there are changes to AzureStor, AzureVM, AzureGraph and AzureContainers. All of these are live...

Read more »

Azure AI and Machine Learning talk series

November 12, 2019
By
Azure AI and Machine Learning talk series

At last week's Microsoft Ignite conference in Orlando, our team delivered a series of 6 talks about AI and machine learning applications with Azure. The videos from each talk...

Read more »

My AP Statistics Class First R Programming Assignment Using RStudio

November 12, 2019
By
My AP Statistics Class First R Programming Assignment Using RStudio

My AP Stats class has started their first R programming assignment this week. I gave them the code for them to type in and play with. This will give...

Read more »

RcppAnnoy 0.0.14

November 12, 2019
By
RcppAnnoy 0.0.14

A new minor release of RcppAnnoy is now on CRAN, following the previous 0.0.13 release in September. RcppAnnoy is the Rcpp-based R integration of the nifty Annoy library by...

Read more »

dplyr and Oracle database with odbc on windows

November 12, 2019
By
dplyr and Oracle database with odbc on windows

RStudio makes Oracle accessibility from R easier via odbc and connections Pane1. Personally, I find it’s not so easy. As it finally works for me, I will detail some snippets...

Read more »

Teach R to see by Borrowing a Brain

November 12, 2019
By
Teach R to see by Borrowing a Brain

It has been an old dream to teach a computer to see, i.e. to hold something in front of a camera and let the computer tell you what it...

Read more »

An API for @racently

November 11, 2019
By
An API for @racently

@racently is a side project that I have been nursing along for a couple of years. It addresses a problem that I have as a runner: my race results...

Read more »

Trying the ckanr Package

November 11, 2019
By
Trying the ckanr Package

How resources are grouped in CKAN Initialising ckanr and exploring groups of resources Connect to CKAN with dplyr and download from one resource Downloading all resources from a dataset In previous blog posts...

Read more »

Community Call – Last Night, Testing Saved my Life

Community Call – Last Night, Testing Saved my Life

To the uninitiated, software testing may seem variously boring, daunting or bogged down in obscure terminology. However, it has the potential to be enormously useful for people developing software...

Read more »

What can we really expect to learn from a pilot study?

November 11, 2019
By
What can we really expect to learn from a pilot study?

I am involved with a very interesting project - the NIA IMPACT Collaboratory - where a primary goal is to fund a large group of pragmatic pilot studies to...

Read more »

Using R and H2O Isolation Forest For Data Quality

November 11, 2019
By
Using R and H2O Isolation Forest For Data Quality

Introduction: We will identify anomalous patterns in data, this process is useful, not only to find inconsistencies and errors but also to find abnormal data behavior, being useful even to...

Read more »

Free Training: Mastering Data Structures in R

November 11, 2019
By

Next week I will be delivering a free online R training. This is a new course I've created called Mastering Data Structures in R. This course is for you...

Read more »

Scraping Machinery Parts

November 10, 2019
By
Scraping Machinery Parts

I’ve been exploring the feasibility of aggregating data on prices of replacement parts for heavy machinery. There are a number of websites which list this sort of data. I’m...

Read more »

Geocoding with Tidygeocoder

November 10, 2019
By
Geocoding with Tidygeocoder

Tidygeocoder is a newly published R package which provides a tidyverse-style interface for geocoding. It returns latitude and longitude coordinates in tibble format from addresses using the US Census or...

Read more »

Statistical uncertainty with R and pdqr

November 10, 2019
By
Statistical uncertainty with R and pdqr

CRAN has accepted my 'pdqr' package. Here are important examples of how it can be used to describe and evaluate statistical uncertainty. ...

Read more »

A comparison of methods for predicting clothing classes using the Fashion MNIST dataset in RStudio and Python (Part 1)

November 10, 2019
By
A comparison of methods for predicting clothing classes using the Fashion MNIST dataset in RStudio and Python (Part 1)

Florianne Verkroost is a PhD candidate at Nuffield College at the University of Oxford. With a passion for data science and a background in mathematics and econometrics. She applies...

Read more »

Cleaning the Table

November 10, 2019
By

While I’m talking about getting data into R this weekend, here’s another quick example that came up in class this week. The mortality data in the previous example were...

Read more »

Search R-bloggers

Sponsors