Estimating the Accuracy of Learned Concepts
Abstract
This paper investigates alternative estimators of the accuracy of concepts learned from examples. In particular, the cross-validation and 632 bootstrap estimators are studied, using synthetic training data and the foil learning algorithm. Our experimental results contradict previous papers in statistics, which advocate the 632 bootstrap method as superior to crossvalidation. Nevertheless, our results also suggest that conclusions based on cross-validation in previous machine learning papers are unreliable. Specifically, our observations are that (i) the true error of the concept learned by foil from independently drawn sets of examples of the same concept varies widely, (ii) the estimate of true error provided by cross-validation has high variability but is approximately unbiased, and (iii) the 632 bootstrap estimator has lower variability than cross-validation, but is systematically biased. 1 Introduction The problem of concept induction (also known as the classification problem [ K...
Cite
Text
Bailey and Elkan. "Estimating the Accuracy of Learned Concepts." International Joint Conference on Artificial Intelligence, 1993.Markdown
[Bailey and Elkan. "Estimating the Accuracy of Learned Concepts." International Joint Conference on Artificial Intelligence, 1993.](https://mlanthology.org/ijcai/1993/bailey1993ijcai-estimating/)BibTeX
@inproceedings{bailey1993ijcai-estimating,
title = {{Estimating the Accuracy of Learned Concepts}},
author = {Bailey, Timothy L. and Elkan, Charles},
booktitle = {International Joint Conference on Artificial Intelligence},
year = {1993},
pages = {895-901},
url = {https://mlanthology.org/ijcai/1993/bailey1993ijcai-estimating/}
}