Blog Archives

Solutions on github

November 12, 2014
By

See this page. We're not done with them all but chapter 3 and 4 are there and the regression chapters are not too far behind. The Rnw files (using knitr LaTeX) are there along with the corresponding pdf files. You may have better solutions than ...

Read more »

Some Thoughts on “Do we Need Hundreds of Classifiers to Solve Real World Classification Problems?”

November 11, 2014
By

Sorry for the blogging break. I’ve got a few planned for the next few weeks based on some work I’ve been doing. In the meantime, you should check out “Do we Need Hundreds of Classifiers to Solve Real World Classification Problems?” by Manuel Fernandez-Delgado at JMLR. They took a large number of classifiers and ran them against...

Read more »

useR! 2014 Highlights

July 3, 2014
By

My talk went well; here are the slides and a link to the paper pre-print. Hadley Wickham gave an excellent tutorial on dplyr. Based on the talk I saw, I think I will take the data sets from the book and make some public visualizations on the Plotly we...

Read more »

New caret version with adaptive resampling

May 28, 2014
By
New caret version with adaptive resampling

A new version of caret is on CRAN now. There are a number of bug fixes: A man page with the list of models available via train was added back into the package. See ?models. Thoralf Mildenberger found and fixed a bug in the variable importance calcu...

Read more »

A Tutorial and Talk at useR! 2014 [Important Update]

May 12, 2014
By

See the update below I'll be doing a morning tutorial at useR! at the end of June in Los Angeles. I've done this same presentation at the last few conferences and this will probably be the last time for this specific workshop. The tutorial outline is: Conventions in R Data splitting and estimating performance Data pre-processing Over-fitting and resampling Training and tuning tree...

Read more »

A Tutorial and Talk at useR! 2014

May 7, 2014
By

I'll be doing a morning tutorial at useR! at the end of June in Los Angeles. I've done this same presentation at the last few conferences and this will probably be the last time for this specific workshop. I will be including a copy of the book for ...

Read more »

Bay Area RUG Talk on 3/17

March 9, 2014
By

I'm making my yearly pilgrimage to San Fransico to teach at PAW. I'll also be giving a short talk at the Bay Area R Users Group on model tags in the caret package and the code that produced this interactive plot. It is at 7:00 PM on Monday March 17...

Read more »

caret webinar materials

February 28, 2014
By

The webinar was recorded (thanks to Ray DiGiacomo and the Orange County RUG). The slides are here minus a few typos. 

Read more »

Optimizing Probability Thresholds for Class Imbalances

February 6, 2014
By
Optimizing Probability Thresholds for Class Imbalances

One of the toughest problems in predictive model occurs when the classes have a severe imbalance. We spend an entire chapter on this subject itself. One consequence of this is that the performance is generally very biased against the class with the smallest frequencies. For example, if the data have a majority of samples belonging to the first...

Read more »

caret webinar on Feb 25

February 2, 2014
By

I"ll be doing a webinar with the Orange County R User Group on the caret package on Tue, Feb 25, 2014 1:00 PM - 2:00 PM EST.Here is the url in case you are interested: https://www3.gotomeeting.com/register/673845982Thanks to Ray DiGiacom...

Read more »

Sponsors

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)