Automatic Sample-by-Sample Model Selection Between Two Off-the-Shelf Classifiers

Chadwick, Steve P.

Automatic Sample-by-Sample Model Selection Between Two Off-the-Shelf Classifiers

AAAI 1999 pp. 958

/aaai/1999/chadwick1999aaai-automatic/

Abstract

If one could predict which of two classifiers will correctly classify a particular sample, then one could use the better classifier. Continuing this selection process throughout the data set should result in improved accuracy over either classifier alone. Fortunately, scalar measures which relate to the degree of confidence that we have in a classification can be computed for most common classifiers (Hastie &amp; Tibshirani 1996). Some examples of confidence measures are distance from a linear discriminant separating plane (Duda &amp; Hart 1973), distance to the nearest neighbor, distance to the nearest unlike neighbor, and distance to the center of correctly classified training data. We propose to apply discriminant analysis to the confidence measures, producing a rule which determines when one classifier is expected to be more accurate than the other. Let q1(x) andq2(x) be scalar functions for the confidence measures of two off-the-shelf classifiers. Each sample, xi, ismappedto(q1(xi),q2(xi)) in the decision space for selecting a classifier, thus the decision space has only two dimensions. Observe that the sample space has d-dimensions where d is the number of features in the sample. In this respect the dimensionality of selecting the classifier is reduced from d to 2. In order to select the better classifier, we need an estimate of where each classifier succeeds or fails. Both classifiers are applied to each training sample to create this estimate. Classifiers which never misclassify a training sample, such as nearest neighbors, are evaluated by leave-one-out runs. Each training sample now has two confidence values, one from each confidence function. It is also known whether each classifier has correctly classified each of the training samples. This classification information is used to associate a value selected from −1, 0, 1 with each training sample. This value is termed correctness. 1, if first is best; correctness = 0, if both the same; −1, if second is best. If the first classifier is correct and the second classifier

PDF AAAI Semantic Scholar

Cite

Text

Chadwick. "Automatic Sample-by-Sample Model Selection Between Two Off-the-Shelf Classifiers." AAAI Conference on Artificial Intelligence, 1999.

Markdown

[Chadwick. "Automatic Sample-by-Sample Model Selection Between Two Off-the-Shelf Classifiers." AAAI Conference on Artificial Intelligence, 1999.](https://mlanthology.org/aaai/1999/chadwick1999aaai-automatic/)

BibTeX

@inproceedings{chadwick1999aaai-automatic,
  title     = {{Automatic Sample-by-Sample Model Selection Between Two Off-the-Shelf Classifiers}},
  author    = {Chadwick, Steve P.},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {1999},
  pages     = {958},
  url       = {https://mlanthology.org/aaai/1999/chadwick1999aaai-automatic/}
}