Exploiting the Omission of Irrelevant Data

Abstract

Most learning algorithms work most effectively when their training data contain completely specified labeled samples. In many diagnostic tasks, however, the data will include the values of only some of the attributes; we model this as a blocking process that hides the values of those attributes from the learner. While blockers that remove the values of critical attributes can handicap a learner, this paper instead focuses on blockers that remove only irrelevant attribute values, i.e., values that are not needed to classify an instance, given the values of the other unblocked attributes. We first motivate and formalize this model of "superfluous-value blocking", and then demonstrate that these omissions can be useful, by proving that certain classes that seem hard to learn in the general PAC model --- viz., decision trees and DNF formulae --- are trivial to learn in this setting. We also show that this model can be extended to deal with (1) theory revision (i.e., modifying an existing ...

Cite

Text

Greiner et al. "Exploiting the Omission of Irrelevant Data." International Conference on Machine Learning, 1996.

Markdown

[Greiner et al. "Exploiting the Omission of Irrelevant Data." International Conference on Machine Learning, 1996.](https://mlanthology.org/icml/1996/greiner1996icml-exploiting/)

BibTeX

@inproceedings{greiner1996icml-exploiting,
  title     = {{Exploiting the Omission of Irrelevant Data}},
  author    = {Greiner, Russell and Grove, Adam J. and Kogan, Alexander},
  booktitle = {International Conference on Machine Learning},
  year      = {1996},
  pages     = {216-224},
  url       = {https://mlanthology.org/icml/1996/greiner1996icml-exploiting/}
}