Estimating the Expected Error of Empirical Minimizers for Model Selection
Abstract
Model selection [e.g., 1] is considered the problem of choosing a hypothesis language which provides an optimal balance between low empirical error and high structural complexity. In this Abstract, we discuss the intuition of a new, very efficient approach to model selection. Our approach is inherently Bayesian [e.g., 2], but instead of using priors on target functions or hypotheses, we talk about priors on error values -- which leads us to a new mathematical characterization of the expected true error. In the setting of classification learning, a learner is given a sample, drawn according to an unknown distribution of labeled instances, and returns the empirical minimizer (the hypothesis with the least empirical error) which has a certain (unknown) true error. If this process is carried out repeatedly, the true error of the empirical minimizer will vary from run to run as the empirical minimizer depends on the (randomly drawn) sample. This induces a distribution of true errors of empirical minimizers, over the possible samples drawn according to the unknown distribution. If this distribution would be known, one could easily derive the expected true error of the empirical minimizer of a model by integrating over this distribution. This would immediately lead to an optimal model selection algorithm: Enumerate the models, calculate the expected error of each model by integrating over the error distribution, and select the model with the least expected error. PAC theory [3] and the VC framework provide worst-case bounds on the chance of drawing a sample such that the true error of the minimizer exceeds some " -- "worst-case" meaning that they hold for any distribution of instances and any concept in a given class. By contrast, we focus on how to determine this distributi...
Cite
Text
Scheffer and Joachims. "Estimating the Expected Error of Empirical Minimizers for Model Selection." AAAI Conference on Artificial Intelligence, 1998.Markdown
[Scheffer and Joachims. "Estimating the Expected Error of Empirical Minimizers for Model Selection." AAAI Conference on Artificial Intelligence, 1998.](https://mlanthology.org/aaai/1998/scheffer1998aaai-estimating/)BibTeX
@inproceedings{scheffer1998aaai-estimating,
title = {{Estimating the Expected Error of Empirical Minimizers for Model Selection}},
author = {Scheffer, Tobias and Joachims, Thorsten},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {1998},
pages = {1200},
url = {https://mlanthology.org/aaai/1998/scheffer1998aaai-estimating/}
}