Redundant Feature Elimination for Multi-Class Problems

Abstract

We consider the problem of eliminating redundant Boolean features for a givendata set, where a feature is redundant if it separates the classes less wellthan another feature or set of features. Lavrac et al. proposed the algorithmREDUCE that works by pairwise comparison of features, i.e., it eliminates afeature if it is redundant with respect to another feature. Their algorithmoperates in an ILP setting and is restricted to two-class problems. In thispaper we improve their method and extend it to multiple classes. Central toour approach is the notion of a neighbourhood of examples: a set of examplesof the same class where the number of different features between examples isrelatively small. Redundant features are eliminated by applying a revisedversion of the REDUCE method to each pair of neighbourhoods of differentclass. We analyse the performance of our method on a range of data sets.

Cite

Text

Appice et al. "Redundant Feature Elimination for Multi-Class Problems." International Conference on Machine Learning, 2004. doi:10.1145/1015330.1015397

Markdown

[Appice et al. "Redundant Feature Elimination for Multi-Class Problems." International Conference on Machine Learning, 2004.](https://mlanthology.org/icml/2004/appice2004icml-redundant/) doi:10.1145/1015330.1015397

BibTeX

@inproceedings{appice2004icml-redundant,
  title     = {{Redundant Feature Elimination for Multi-Class Problems}},
  author    = {Appice, Annalisa and Ceci, Michelangelo and Rawles, Simon Alan and Flach, Peter A.},
  booktitle = {International Conference on Machine Learning},
  year      = {2004},
  doi       = {10.1145/1015330.1015397},
  url       = {https://mlanthology.org/icml/2004/appice2004icml-redundant/}
}