Articles by Max Kuhn

2022 tidymodels user survey

October 7, 2021 | Max Kuhn

We are conducting another survey to see where u...

tidymodels updates and voting!

April 27, 2020 | Max Kuhn

While I'm still supporting caret, the majority of my development effort has gone into the tidyverse modeling packages (called tidymodels). If you've never heard of this, we have just made an excellent learning resources at tidymodels.org. You might consider focusing on the Get Started pages. Another item of note: ...

Slides from R/Pharma

August 16, 2018 | Max Kuhn

My slides from the R/Pharma conference on "Modeling in the Tidyverse" are in pdf format as well as the HTML version. (Joe Cheng just killed it in his shiny presentation - see this...

R/Medicine conference

August 15, 2018 | Max Kuhn

I'll be giving a talk at the R/Medicine conference on Sept 7th in New Haven CT. My talk is on modeling in the tidyverse but there are some excellent speakers. Rob Tibshirani, Mike...

Podcast on Nonclinical Statistics

July 30, 2018 | Max Kuhn

Hugo Bowne-Anderson and I spoke about about data science in pharmaceuticals, the tidyverse, and more for the excellent DataFramed podcast from DataCamp. Listen to it here or throug...

Early draft of our “Feature Engineering and Selection” book

May 14, 2018 | Max Kuhn

Kjell and I are writing another book on predictive modeling, this time focused on all the things that you can do with predictors. It's about 60% done and we'd love to get feedback....

tidyposterior slides

May 4, 2018 | Max Kuhn

tidyposterior is an R package for comparing models based on their resampling statistics. There are a few case studies on the webpage to illustrate the process. I gave a talk at th... [Read more...]

New Workshop in Washington DC (August)

April 10, 2018 | Max Kuhn

I'll be conducting a workshop called "Applied Machine Learning" in Washington DC on August 15 and 16. The last one, at the RStudio conference, sold out quickly. The 2 day course i...

Tidy Resampling Redux with Agricultural Economics Data

March 12, 2018 | Max Kuhn

(No statistical graphs in this one. This is what my dog Artemis looks like when she wants my attention during work hours.) Mindy L. Mallory (@ace_prof) wrote a blog post on Machine...

RStudio 2018 Conference Presentation and Materials

March 4, 2018 | Max Kuhn

We've released our videos of the talks at the 2018 RStudio conference. My talk was Modeling in the Tidyverse (video) and I was also in the Tidyverse fireside chat (video). There ar...

While you wait for that to finish, can I interest you in parallel processing?

January 17, 2018 | Max Kuhn

caret has been able to utilize parallel processing for some time (before it was on CRAN in October 2007) using slightly different versions of the package. Around September of 2011, caret started using the foreach package was used to "harmonize" the par...

Lots of Package News

December 11, 2017 | Max Kuhn

I've sent a slew of packages to CRAN recently (thanks to Swetlana and Uwe). There are updates to: caret was primarily updated to deal ...

caret Cheatsheet

September 12, 2017 | Max Kuhn

It can be found on the RStudio cheatsheet page. Suggestions and pull requests are always welcome.

Nested Resampling with rsample

September 4, 2017 | Max Kuhn

A typical scheme for splitting the data when developing a predictive model is to create an initial split of the data into a training and test set. If resampling is used, it is executed on the training set where a series of binary splits is created. In rsample, we use ...

Do Resampling Estimates Have Low Correlation to the Truth? The Answer May Shock You.

April 23, 2017 | Max Kuhn

One criticism that is often leveled against using resampling methods (such as cross-validation) to measure model performance is that there is no correlation between the CV results and the true error rate. Let's look at this with some simulated data. W...

Working at RStudio

November 28, 2016 | Max Kuhn

I've joined Hadley's team at RStudio. Unsurprisingly, I’ll be working on some modeling related R packages ...

2016 UK Tour

September 26, 2016 | Max Kuhn

I'll be in the UK next week doing three talks in three days: First, I'll be giving a talk at the London R-Ladies meetup on Monday October 3rd with perhaps the best title yet: Whose Scat Is That? An 'Easil...

DataCamp Course

September 26, 2016 | Max Kuhn

Zachary Deane-Mayer, who collaborates on caret, has put together a DataCamp course on Machine Learning in R. Zach and DataCamp did a great job of developing a course that is just right for people who are ...

Boston R User Group Talk [UPDATE]

March 4, 2016 | Max Kuhn

I'll be giving a talk on Boston R user Group on Thursday March 10th at 6:00 PM. The talk will be on rule-based regression models. The image above is the training/test set split for the data that I'll be us... [Read more...]

Boston R User Group Talk [UPDATE]

March 4, 2016 | Max Kuhn

I'll be giving a talk on Boston R user Group on Thursday March 10th at 6:00 PM. The talk will be on rule-based regression models. The image above is the training/test set split for the data that I...

1 2 3 4 »

Copyright © 2025 | MH Corporate basic by MH Themes