Improving Minority Class Prediction Using Case-Specific Feature Weights
Abstract
This paper addresses the problem of handling skewed class distributions within the case-based learning (CBL) framework. We rst present as a baseline an informationgain-weighted CBL algorithm and apply it to three data sets from natural language processing (NLP) with skewed class distributions. Although overall performance of the baseline CBL algorithm is good, we show that the algorithm exhibits poor performance on minority class instances. We then present two CBL algorithms designed to improve the performance of minority class predictions. Each variation creates test-case-speci c feature weights by rst observing the path taken by the test case in a decision tree created for the learning task, and then using pathspeci c information gain values to create an appropriate weight vector for use during case retrieval. When applied to the NLP data sets, the algorithms are shown to signi cantly increase the accuracy of minority class predictions while maintaining or improving overall classi cation accuracy. 1
Cite
Text
Cardie and Nowe. "Improving Minority Class Prediction Using Case-Specific Feature Weights." International Conference on Machine Learning, 1997.Markdown
[Cardie and Nowe. "Improving Minority Class Prediction Using Case-Specific Feature Weights." International Conference on Machine Learning, 1997.](https://mlanthology.org/icml/1997/cardie1997icml-improving/)BibTeX
@inproceedings{cardie1997icml-improving,
title = {{Improving Minority Class Prediction Using Case-Specific Feature Weights}},
author = {Cardie, Claire and Nowe, Nicholas},
booktitle = {International Conference on Machine Learning},
year = {1997},
pages = {57-65},
url = {https://mlanthology.org/icml/1997/cardie1997icml-improving/}
}