Learning Stochastic Functions by Smooth Simultaneous Estimation

Abstract

To learn, it suffices to estimate the error of all candidate hypotheses simultaneously. We study the problem of when this "simultaneous estimation" is possible and show that it leads to new learning procedures and weaker sufficient conditions for a broad class of learning problems. We modify the standard Probably Approximately Correct (PAC) setup to allow concepts that are "stochastic functions." A deterministic function maps a set X into a set Y, whereas a stochastic function is a probability distribution on X x Y. We approach the simultaenous estimation problem by concentrating on a subset of all estimators, those that satisfy a natural "smoothness" constraint. The common empirical estimator falls within this class. We show that smooth simultaneous estimability can be characterized by a sampling-based criterion. Also, we describe a canonical estimator for this class of problems. This canonical estimator has a unique form: it uses part of the samples to select a finite subset of hypotheses that approximates the class of candidate hypotheses, and then it uses the rest of the samples to estimate the error of each hypothesis in the subset. Finally, we show that a learning procedure based on the canonical estimator will work in every case where empirical error minimization does.

Cite

Text

Buescher and Kumar. "Learning Stochastic Functions by Smooth Simultaneous Estimation." Annual Conference on Computational Learning Theory, 1992. doi:10.1145/130385.130415

Markdown

[Buescher and Kumar. "Learning Stochastic Functions by Smooth Simultaneous Estimation." Annual Conference on Computational Learning Theory, 1992.](https://mlanthology.org/colt/1992/buescher1992colt-learning/) doi:10.1145/130385.130415

BibTeX

@inproceedings{buescher1992colt-learning,
  title     = {{Learning Stochastic Functions by Smooth Simultaneous Estimation}},
  author    = {Buescher, Kevin and Kumar, P. R.},
  booktitle = {Annual Conference on Computational Learning Theory},
  year      = {1992},
  pages     = {272-279},
  doi       = {10.1145/130385.130415},
  url       = {https://mlanthology.org/colt/1992/buescher1992colt-learning/}
}