Inference for the Generalization Error

Abstract

In order to to compare learning algorithms, experimental results reported in the machine learning litterature often use statistical tests of signifi(cid:173) cance. Unfortunately, most of these tests do not take into account the variability due to the choice of training set. We perform a theoretical investigation of the variance of the cross-validation estimate of the gen(cid:173) eralization error that takes into account the variability due to the choice of training sets. This allows us to propose two new ways to estimate this variance. We show, via simulations, that these new statistics perform well relative to the statistics considered by Dietterich (Dietterich, 1998).

Cite

Text

Nadeau and Bengio. "Inference for the Generalization Error." Neural Information Processing Systems, 1999.

Markdown

[Nadeau and Bengio. "Inference for the Generalization Error." Neural Information Processing Systems, 1999.](https://mlanthology.org/neurips/1999/nadeau1999neurips-inference/)

BibTeX

@inproceedings{nadeau1999neurips-inference,
  title     = {{Inference for the Generalization Error}},
  author    = {Nadeau, Claude and Bengio, Yoshua},
  booktitle = {Neural Information Processing Systems},
  year      = {1999},
  pages     = {307-313},
  url       = {https://mlanthology.org/neurips/1999/nadeau1999neurips-inference/}
}