The case for index-free data manipulation

December 10, 2016
By
The case for index-free data manipulation

Statisticians and data scientists want a neat world where data is arranged in a table such that every row is an observation or instance, and every column is a variable or measurement. Getting to this state of “ready to model format” (often called a denormalized form by relational algebra types) often requires quite a bit … Continue...

Read more »

Don’t give up on single trees yet…. An interactive tree with Microsoft R

December 10, 2016
By
Don’t give up on single trees yet…. An interactive tree with Microsoft R

Introduction A few days ago Microsoft announced their new Microsoft R Server 9.0 version. Among a lot of new things, it includes some new and improved machine learning algorithms...

Read more »

Grid search in the tidyverse

December 10, 2016
By
Grid search in the tidyverse

@drsimonj here to share a tidyverse method of grid search for optimizing a model’s hyperparameters.  Grid Search For anyone who’s unfamiliar with the term, grid search involves running...

Read more »

Handling Class Imbalance with R and Caret – An Introduction

December 9, 2016
By
Handling Class Imbalance with R and Caret – An Introduction

When faced with classification tasks in the real world, it can be challenging to deal with an outcome where one class heavily outweighs the other (a.k.a., imbalanced classes). The...

Read more »

Fantasy Value So Far This Year: ADP vs VOR

December 9, 2016
By

Since I am already knocked out of the playoffs (our league playoffs start in week 12), I thought it would be interesting to see which players ended up being...

Read more »

The Value of R’s Open Source Ecosystem

December 9, 2016
By

I was thrilled to be invited to speak at the Monktoberfest conference, held this past October in Portland, Maine. Not only have I been a great fan of the...

Read more »

December ’16 RStudio Tips and Tricks

December 9, 2016
By
December ’16 RStudio Tips and Tricks

by Sean Lopp Here is this month’s collection of RStudio Tips and Tricks. Thank you to those who responded to last month’s post; many of your tips are included...

Read more »

Basic Tree 1 Exercises

December 9, 2016
By
Basic Tree 1 Exercises

Using the knowledge you acquired in the previous exercises on sampling and selecting(here), we will now go through an entire data analysis process. You will be using what you...

Read more »

JHU-Coursera Data Science Specialization and MOOCs Interest

December 9, 2016
By
JHU-Coursera Data Science Specialization and MOOCs Interest

Introduction I have completed this specialization more than a year ago but I have decided to write again about it because I'm teaching Data Visualization at Universidad de Chile and...

Read more »

The Wordcloud2 library

December 9, 2016
By
The Wordcloud2 library

Read more »

Outlier detection and treatment with R

December 9, 2016
By
Outlier detection and treatment with R

Outliers in data can distort predictions and affect the accuracy, if you don’t detect and handle them appropriately especially in regression models. Why outliers treatment is important? Because, it...

Read more »

Extrapolation is tough for trees!

December 9, 2016
By
Extrapolation is tough for trees!

Out-of-sample extrapolation This post is an offshoot of some simple experiments I made to help clarify my thinking about some machine learning methods. In this experiment I fit four...

Read more »

Nomen omen

December 9, 2016
By
Nomen omen

After resisting this for way too long, I've finally decided it was time to release more widely a couple of the R packages I've...

Read more »

TOST equivalence testing R package (TOSTER) and spreadsheet

December 9, 2016
By
TOST equivalence testing R package (TOSTER) and spreadsheet

(This article was first published on The 20% Statistician, and kindly contributed to R-bloggers) I’m happy to announce my first R package ‘TOSTER’ for equivalence tests (but don’t worry,...

Read more »

Announcing pdftools 1.0

December 9, 2016
By

This week we released version 1.0 of the ropensci pdftools package to CRAN. Pdftools provides utilities for extracting text, fonts, attachments and other data from PDF files. It...

Read more »

Computing SE for PSD indices

December 9, 2016
By

A user of my Introductory Fisheries Analyses with R book recently asked me how to compute standard errors (SE) for the various PSD indices by...

Read more »

Where to Go from Here? Tips for Building Up R Experience

December 8, 2016
By
Where to Go from Here? Tips for Building Up R Experience

At the University of Utah, I teach the R lab that accompanies MATH 3070, “Applied Statistics I.”” None of my students are presumed to have any programming experience, and...

Read more »

R Consortium Projects Update

December 8, 2016
By
R Consortium Projects Update

The R Consortium has already funded 8 projects (and 3 more just in July) proposed by the R community, and the call for proposals for yet more projects is...

Read more »

Model Performance in Data Science Live Book

December 8, 2016
By
Model Performance in Data Science Live Book

An overview on error analysis through cross-validation, variance, bias, bootstrapping, accuracy and time dependent predictive models.

Read more »

Matrix Vol. 2 Exercises

Matrix Vol. 2 Exercises

Answers to the exercises are available here. Exercise 1 If M=matrix(c(1:10),nrow=5,ncol=2, dimnames=list(c('a','b','c','d','e'),c('A','B')))...

Read more »

New: Italian and German Translations of Introduction to R

December 8, 2016
By
New: Italian and German Translations of Introduction to R

The team here at DataCamp is thrilled to announce that we now offer free Italian (thanks to Quantide) and German (thanks to eoda) translations of our most popular course, Introduction...

Read more »

Crossed and Nested hierarchical models with STAN and R

December 8, 2016
By
Crossed and Nested hierarchical models with STAN and R

Below I will expand on previous posts on bayesian regression modelling using STAN (see previous instalments here, here, and here). Topic of the day is modelling crossed and nested...

Read more »

Tesseract Update: Options and Languages

December 8, 2016
By

A few weeks ago we announced the first release of the tesseract package: a high quality OCR engine in R. We have now released an update with...

Read more »

RcppAPT 0.0.3

December 7, 2016
By

A new version of RcppAPT -- our interface from R to the C++ library behind the awesome apt, apt-get, apt-cache, ... commands and their cache powering Debian, Ubuntu...

Read more »

Visualizing 15k Instagram Posts with TrelliscopeJS

December 7, 2016
By

This post shows a simple example of creating an interactive display that allows you to navigate thousands of instagram posts with just a few lines of code using TrelliscopeJS....

Read more »

flea circus

December 7, 2016
By
flea circus

An old riddle found on X validated asking for Monte Carlo resolution  but originally given on Project Euler: A 30×30 grid of squares contains 30² fleas, initially one flea...

Read more »

PISA 2015 – how to read/process/plot the data with R

December 7, 2016
By
PISA 2015 – how to read/process/plot the data with R

Yesterday OECD has published results and data from PISA 2015 study (Programme for International Student Assessment). It’s a very cool study – over 500 000 pupils (15-years old) are...

Read more »

2016-15 Automating R Demonstration Videos

December 7, 2016
By

This document describes a proof-of-concept for producing R demonstration videos in a fully-automated manner. The “script” for the video consists of a text file containing code chunks paired with...

Read more »

Microsoft R Server 9.0 now available

December 7, 2016
By
Microsoft R Server 9.0 now available

Microsoft R Server 9.0, Microsoft's R distribution with added big-data, in-database, and integration capabilities, was released today and is now available for download to MSDN subscribers. This latest release...

Read more »

Sponsors

Mango solutions



dominolab webpage



Zero Inflated Models and Generalized Linear Mixed Models with R

Quantide: statistical consulting and training

datasociety

http://www.eoda.de





CRC R books series





Six Sigma Online Training





Contact us if you wish to help support R-bloggers, and place your banner here.