Blog Archives

Example of Overfitting

November 16, 2018
By

I occasionally see queries on various social media as to overfitting — what is it?, etc. I’ll post an example here. (I mentioned it at my talk the other night on our novel approach to missing values, but had a bug in the code. Here is the correct account.) The dataset is prgeng, on wages of … Continue reading Example...

Read more »

Manifold Visualization: Second Example

October 1, 2018
By
Manifold Visualization: Second Example

In last night’s post, I introduced prVis(), a new visualization tool which we have invented, available in our polyreg package. Recall that prVis() is intended as a simpler alternative to recent visualization tools like t-SNE and UMAP. Here I will post another example. The dataset is prgeng, included in the package. It consists of wage … Continue reading Manifold...

Read more »

Manifold Visualization: Polynomials to the Rescue

October 1, 2018
By
Manifold Visualization: Polynomials to the Rescue

Our arXiv paper and the associated R package polyreg caused a bit of a stir, both pro and con, when we first announced them here in June. The discussion even spread as far as Twitter, Reddit and Hacker News. We’ll be announcing a revised paper, and various new features to the package, very soon. But … Continue reading Manifold...

Read more »

What, No Parentheses?

August 25, 2018
By

I’m about to show you an R trick. Various readers may find it cool, useful and interesting, or stupid, useless and an evil deed undermining the sanctity of R’s functional programming nature (“All bow”). But I hope many of you will find the material here rather intriguing if not useful. All this involves a trick … Continue reading What,...

Read more »

Update on Polynomial Regression in Lieu of Neural Nets

July 1, 2018
By
Update on Polynomial Regression in Lieu of Neural Nets

There was quite a reaction to our paper, “Polynomial Regression as an Alternative to Neural Nets” (by Cheng, Khomtchouk, Matloff and Mohanty), leading to discussions/debates on Twitter, Reddit, Hacker News and so on. Accordingly, we have posted a revised version of the paper. Some of the new features: Though originally we had made the disclaimer … Continue reading Update...

Read more »

Neural Networks Are Essentially Polynomial Regression

June 20, 2018
By

You may be interested in my new arXiv paper, joint work with Xi Cheng, an undergraduate at UC Davis (now heading to Cornell for grad school); Bohdan Khomtchouk, a post doc in biology at Stanford; and Pete Mohanty,  a Science, Engineering & Education Fellow in statistics at Stanford. The paper is of a provocative nature, … Continue reading Neural...

Read more »

Women in R

June 8, 2018
By

Last week I gave one of the keynote addresses at R/Finance 2018 in Chicago. I considered it an honor and a pleasure to be there, both because of the stimulating intellectual exchange and the fine level of camaraderie and hospitality that prevailed. I mentioned at the start of my talk that the success of this … Continue reading Women...

Read more »

Xie Yihui, R Superstar and Mensch

February 23, 2018
By
Xie Yihui, R Superstar and Mensch

Yesterday a friend told me, “Yihui has written the most remarkably open blog post, and you’ve got to read it.” I did and it was. Though my post here is not about R per se, it is about a great contributor to R, our Yihui, Dr. of Statistics and (according to him) Master of Procrastination. … Continue reading Xie...

Read more »

Regression Analysis — What You Should’ve Been Taught But Weren’t, and Were Taught But Shouldn’t Have Been

September 20, 2017
By
Regression Analysis — What You Should’ve Been Taught But Weren’t,  and Were Taught But Shouldn’t Have Been

The above title was the title of my talk this evening at our Bay Area R Users Group. I had been asked to talk about my new book, and I presented four of the myths that are dispelled in the book. Hadley also gave an interesting talk, “An introduction to tidy evaluation,” involving some library … Continue reading Regression...

Read more »

cdparcoord: Parallel Coordinates Plots for Categorical Data

September 4, 2017
By
cdparcoord: Parallel Coordinates Plots for Categorical Data

My students, Vincent Yang and Harrison Nguyen, and I have developed a new data visualization package, cdparcoord, available now on CRAN. It can be viewed as an extension of the freqparcoord package written by a former grad student, Yingkang Xie and myself, which I have written about before in this blog. The idea behind both … Continue reading cdparcoord:...

Read more »

Search R-bloggers


Sponsors

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)