Learning in the Presence of Inaccurate Information
Abstract
The present paper considers the effects of introducing inaccuracies in a learner's environment in Gold's learning model of identification in the limit. Three kinds of inaccuracies are considered: presence of spurious data is modeled as learning from a noisy environment, missing data is modeled as learning from incomplete environment, and the presence of a mixture of both spurious and missing data is modeled as learning from imperfect environment. Two learning domains are considered, namely, identification of programs from graphs of computable functions and identification of grammars from positive data about recursively enumerable languages. Many hierarchies and tradeoffs resulting from the interplay between the number of errors allowed in the final hypotheses, the number of inaccuracies in the data, the types of inaccuracies, and the type of success criteria are derived. An interesting result is that in the context of function learning, incomplete data is strictly worse for learning than noisy data.
Cite
Text
Fulk and Jain. "Learning in the Presence of Inaccurate Information." Annual Conference on Computational Learning Theory, 1989. doi:10.1016/0304-3975(95)00135-2Markdown
[Fulk and Jain. "Learning in the Presence of Inaccurate Information." Annual Conference on Computational Learning Theory, 1989.](https://mlanthology.org/colt/1989/fulk1989colt-learning/) doi:10.1016/0304-3975(95)00135-2BibTeX
@inproceedings{fulk1989colt-learning,
title = {{Learning in the Presence of Inaccurate Information}},
author = {Fulk, Mark A. and Jain, Sanjay},
booktitle = {Annual Conference on Computational Learning Theory},
year = {1989},
pages = {175-188},
doi = {10.1016/0304-3975(95)00135-2},
url = {https://mlanthology.org/colt/1989/fulk1989colt-learning/}
}