Investigating the Distribution Assumptions in the Pac Learning Model
Abstract
In this paper, we question two assumptions of the pac learning model related to the distribution of examples: that the learning algorithm must learn for an arbitrary distribution (the distribution-independence assumption), and that the distribution of training examples is identical to the distribution of examples that the learning system sees in use (the distribution-invariance assumption). We argue that the distribution-independence assumption is too stringent, and we propose a learning model in which the distributions are required to satisfy a parametrized “reasonableness” criterion. This model allows us to extend results for learning functions under uniform distributions to non-uniform distributions. As an example, a bound is given for the time to learn an arbitrary halfspace in this model using the perceptron algorithm. We also argue that the distribution-invariance assumption in the pac model is unrealistic and we give bounds on the sample complexity when the distributions of training and testing examples differ.
Cite
Text
Bartlett and Williamson. "Investigating the Distribution Assumptions in the Pac Learning Model." Annual Conference on Computational Learning Theory, 1991. doi:10.1016/B978-1-55860-213-7.50007-9Markdown
[Bartlett and Williamson. "Investigating the Distribution Assumptions in the Pac Learning Model." Annual Conference on Computational Learning Theory, 1991.](https://mlanthology.org/colt/1991/bartlett1991colt-investigating/) doi:10.1016/B978-1-55860-213-7.50007-9BibTeX
@inproceedings{bartlett1991colt-investigating,
title = {{Investigating the Distribution Assumptions in the Pac Learning Model}},
author = {Bartlett, Peter L. and Williamson, Robert C.},
booktitle = {Annual Conference on Computational Learning Theory},
year = {1991},
pages = {24-32},
doi = {10.1016/B978-1-55860-213-7.50007-9},
url = {https://mlanthology.org/colt/1991/bartlett1991colt-investigating/}
}