Italian

Preparing the data for modelling with R

Detect sentinel values, recode factor variables, replace missing values: a tutorial on various steps in data preparation using R.
The post Preparing the data for modelling with R appeared first on MilanoR.

Cross-Validation for Predictive Analytics Using R

Cross-validation is a widely used model selection method. We show how to implement it in R using both raw code and the functions in the caret package.
The post Cross-Validation for Predictive Analytics Using R appeared first on MilanoR.

How to sort a list of dataframes

A method to gather data from different sources, sort them and keep a reference to the origin of each subset, plus some efficiency considerations
The post How to sort a list of dataframes appeared first on MilanoR.

Aggregation with dplyr: summarise and summarise_each

How to apply one or many functions to one or many variables using dplyr: a practical guide to the use of summarise() and summarise_each()
The post Aggregation with dplyr: summarise and summarise_each appeared first on MilanoR.

“Efficient Data Manipulation with R” Course | April 11-12 Milan

Organize your data manipulation tasks in a standard way, write clean and efficient code, and build reproducible data management processes, using the most modern R tools: tidyr, dplyr and lubridate.

The post “Efficient Data Manipulation with R” Course | April 11-12 Milan appeared first on MilanoR.