## Introduction to recommender systems

August 20, 2018
By

Why build a recommender system? The most wonderful and most frustrating characteristic of the Internet is its excessive supply of content. As a result, many of todayâ€™s commercial giants are not content providers, but content distributors. The success of companies such as Amazon, Netflix, YouTube and Spotify relies on their ability to effectively deliver relevant and novel content to...

## Materials for Teaching Applied Statistics

August 20, 2018
By

Today is the first day of the new academic year at the University of Utah. This semester I am teaching MATH 3070: Applied Statistics I, the fourth time I've...

## BooST series I: Advantage in Smooth Functions

August 20, 2018
By
$BooST series I: Advantage in Smooth Functions$

By Gabriel Vasconcelos and Yuri Fonseca Introduction This is the first of a series of post on the BooST (Boosting Smooth Trees). If you missed the first post introducing...

## rfoaas 2.0.0: Updated and extended

August 20, 2018
By

FOAAS upstream recently went to release 2.0.0, so here we are catching up bringing you all the new accessors from FOAAS 2.0.0: bag(), equity(), fts(), ing(), particular(), ridiculous(),...

## Harvesting Data From the Web With Rvest: Exercises

August 20, 2018
By

The rvest package allows for simple and convenient extraction of data from the web into R, which is often called “web scraping.” Web scraping is a basic and important...

## Reproducible research and a repository of artifacts, a RFC

August 20, 2018
By

ThisÂ  work is still in progres. I think, however, it can already resonate with some people in the community. The communication I am hopeful for should lead to a...

## Statistics Sunday: Using Text Analysis to Become a Better Writer

August 19, 2018
By

Using Text Analysis to Become a Better Writer We all have words we love to use, and that we perhaps use too much. As an example: I have...

## Clustered Covariances in sandwich 2.5-0

August 19, 2018
By

Version 2.5-0 of the R package 'sandwich' is available from CRAN now with enhanced object-oriented clustered covariances (for lm, glm, survreg,...

## Ordered Probit Model and Price Movements of High-Frequency Trades

August 19, 2018
By

The analysis of high frequency stock transactions has played an important role in the algorithmic trading and the result can be used to monitor stock movements and to develop...

## More Practical Data Science with R Book News

August 19, 2018
By

Some more Practical Data Science with R news. Practical Data Science with R is the book we wish we had when we started in data science. Practical Data Science...

## Mapping the Prevalence of Alzheimer Disease Mortality in the USA

August 18, 2018
By

In comparison with other statistical software (e.g., SAS, STATA, and SPSS), R is the best for data visualization. Therefore, in all posts I have written for DataScience+ I take...

## R:case4base – code profiling with base R

August 18, 2018
By

Introduction In this summertime post in the case4base series, we will look at useful tools in base R, which let us profile our code without any extra packages needed to...

## approximative Laplace

August 17, 2018
By

I came across this question on X validated that wondered about one of our examples in Monte Carlo Statistical Methods. We have included a section on Laplace approximations in...

## Topics and Categories in the Russian Troll Tweets

August 17, 2018
By

Topics and Categories in the Russian Troll Tweets I decided to return to the analysis I conducted for the IRA tweets dataset. (You can read up on...

## A Review of James Picerno’s Quantitative Investment Portfolio Analytics in R

August 17, 2018
By

This is a review of James Picerno’s Quantitative Investment Portfolio Analytics in R. Overall, it’s about as fantastic a book … Continue reading →

## How to Create Sankey Diagrams From Tables (Data Frames) Using R

August 17, 2018
By

Step 1: Create a Tidy data frame The very first step in creating visualizations is to get the data in a useful format. In the...

August 17, 2018
By

A new RcppArmadillo release 0.9.100.5.0, based on the new Armadillo release 9.100.5 from earlier today, is now on CRAN and in Debian. It once again follows our (and Conrad's)...

## An R package to create NEWS.md files

August 17, 2018
By

A while back, I started to create an R package that would help me and my collegues at STATWORX with our daily work. After writing the DESCRIPTION file, I...

## Many reports from 1 RMarkdown file

August 17, 2018
By

I was at the EdinbR talk this week by the RStudio community lead – Curtis Kephart. It was really interesting, but I disagree with his suggestion to point and...

August 16, 2018
By

A new RcppArmadillo release 0.9.100.5.0, based on the new Armadillo release 9.100.5 from earlier today, is now on CRAN and in Debian. It once again follows our (and Conrad's)...

## Make R speak

August 16, 2018
By

Every wanted to make R talk to you? Now you can, with the mscstts package by John Muschelli. It provides an interface to the Microsoft Cognitive Services Text-to-Speech API...

## Relative risk ratios and odds ratios by @ellis2013nz

August 16, 2018
By

This post tries to explain the difference between odds ratios and relative risk ratios; and how just a few letters in the code fitting a generalized linear model mean...

## Great post!

August 16, 2018
By

Great post!I wanted to mention that although many previous studies have used the area under receiver operating characteristic curve (auROC) statistic to benchmark the precision, it misleads evaluators when...

## Updates to the sergeant (Apache Drill connector) Package & a look at Apache Drill 1.14.0 release

August 16, 2018
By

Apache Drill 1.14.0 was recently released, bringing with it many new features and a temporary incompatibility with the current rev of the MapR ODBC drivers. The Drill community expects...

August 16, 2018
By

...

## Bio7 2.9 Released

August 16, 2018
By

16.08.2018 A new release of Bio7 is available. The new Bio7 2.9 release comes with a plethora of new R features and bugfixes. Release Notes: General: Based on Eclipse...

## Linear programming in R

August 16, 2018
By

Linear programming is a technique to solve optimization problems whose constraints and outcome are represented by linear relationships. Simply put, linear programming allows to solve problems of the following kind: Maximize/minimize...

## Remaking ‘Luminance-gradient-dependent lightness illusion’ with R

August 16, 2018
By

A blogpost inspired by a tweet and a YouTube video. ‘Luminance-gradient-dependent lightness illusion’ In the last days, I’ve stumbled upon this tweet: A demo of lightness perception pic.twitter.com/BSVpgcuIw1 —...

## How to capitalize on a priori contrasts in linear (mixed) models: A tutorial

August 16, 2018
By

We wrote a short tutorial on contast coding, covering the common contrast coding scenarios, among them: treatment, helmert, anova, sum, and sliding (successive differences) contrasts.  The target audience is...