# Articles by Wicked Good Data - r

### Handling Class Imbalance with R and Caret – Caveats when using the AUC

January 2, 2017 |

In my last post, I went over how weighting and sampling methods can help to improve predictive performance in the case of imbalanced classes. I also included an applied example with a simulated dataset that used the area under the ROC curve (AUC) as th... [Read more...]

### Handling Class Imbalance with R and Caret – Caveats when using the AUC

January 2, 2017 |

In my last post, I went over how weighting and sampling methods can help to improve predictive performance in the case of imbalanced classes. I also included an applied example with a simulated dataset that used the area under the ROC curve (AUC) as th... [Read more...]

### Handling Class Imbalance with R and Caret – An Introduction

December 9, 2016 |

When faced with classification tasks in the real world, it can be challenging to deal with an outcome where one class heavily outweighs the other (a.k.a., imbalanced classes). The following will be a two-part post on some of the techniques that can hel... [Read more...]

### Handling Class Imbalance with R and Caret – An Introduction

December 9, 2016 |

When faced with classification tasks in the real world, it can be challenging to deal with an outcome where one class heavily outweighs the other (a.k.a., imbalanced classes). The following will be a two-part post on some of the techniques that can hel... [Read more...]

### Clustering Mixed Data Types in R

June 21, 2016 |

Clustering allows us to better understand how a sample might be comprised of distinct subgroups given a set of variables. While many introductions to cluster analysis typically review a simple application using continuous variables, clustering data of mixed types (e.g., continuous, ordinal, and nominal) is often of interest. The ... [Read more...]

### Clustering Mixed Data Types in R

June 21, 2016 |

Clustering allows us to better understand how a sample might be comprised of distinct subgroups given a set of variables. While many introductions to cluster analysis typically review a simple application using continuous variables, clustering data of ... [Read more...]

### Partial Dependence Plots

December 22, 2014 |

It can be difficult to understand the functional relations between predictors and an outcome when using black box prediction methods like random forests. One way to investigate these relations is with partial dependence plots. These plots are graphical visualizations of the marginal effect of a given variable (or multiple variables) ... [Read more...]

# Never miss an update! Subscribe to R-bloggers to receive e-mails with the latest R posts.(You will not see this message again.)

Click here to close (This popup will not appear again)