Articles by Gary Hutson

TidyModels or CARET – how they compare?

October 7, 2021 | Gary Hutson

People have been telling me not to teach caret anymore, albeit I do teach TidyModels as well. Their argument is that TidyModels is new and shiny and we should be using the most up to date tools. I get this justification, but is caret really dead? My answer is “not ...
[Read more...]

Common mistakes we Data Scientists make

September 17, 2021 | Gary Hutson

DISCLAIMER I am a data scientist and have made all these mistakes, but I have had the privilege of sitting on the managerial, project lead and developer side of the fence, and here are some tips to getting your stakeholders (i.e. anyone involved in the project team or has ... [Read more...]

ConfusionTableR has made it to CRAN

July 21, 2021 | Gary Hutson

In my dusty GitHub repository, sitting there, was a gem of a tool for tidying the outputs of a machine learning classification model into a record and row-level view for storage in databases. It has taken me time to get this to CRAN, as the dreaded closure, not subsettable, plagues ...
[Read more...]

OddsPlotty has landed on CRAN

June 22, 2021 | Gary Hutson

I am so excited that my package OddsPlotty has landed on CRAN. This was a package I worked on when I was doing lots of multiple comparisons of logistic regression models and wanted a way to visualise the odds ratios on a graph i.e. an odds plot. This package ...
[Read more...]

DTPlyr – easier data.table for DPLYR users

June 8, 2021 | Gary Hutson

Do you program in R and normally use DPLYR for data wrangling, manipulation or whatever term you call it? Have you heard all the hype about data.table and how this package can significantly improve the performance run time of your R scripts? Have you been meaning to get round ...
[Read more...]

NHSDataDictionaRy is back on CRAN

May 6, 2021 | Gary Hutson

The NHSDataDictionaRy package is now back on CRAN, and I am pleased as punch. This update contains the OpenSafely scraper to get data from the website for lookups developed by Ben Goldacre’s team. Why did it disappear? The package disappeared due to me taking it down for major script ...
[Read more...]

ConfusionTableR package has a new function

April 6, 2021 | Gary Hutson

The ConfusionTableR package has a new function. Welcome to var_impeR which takes a trained caret R model and produces a tibble and a supporting variable importance plot. How to use the new var_impeR function The code following shows how to use the new function: Training a CARET model ...
[Read more...]
1 2 3

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)