Multi-Class Ensemble-Based Active Learning

Abstract

Ensemble-based active learning has been proven to efficiently reduce the number of training instances and thus the cost of data acquisition. To determine the utility of a candidate training instance, the disagreement about its class value among the ensemble members is used. While the disagreement for binary classification is easily determined using margins, the adaption to multi-class problems is not straightforward and little studied in the literature. In this paper we consider four approaches to measure ensemble disagreement, including margins, uncertainty sampling and entropy, and evaluate them empirically on various ensemble strategies for active learning. We show that margins outperform the other disagreement measures on three of four active learning strategies. Our experiments also show that some active learning strategies are more sensitive to the choice of disagreement measure than others.

Cite

Text

Körner and Wrobel. "Multi-Class Ensemble-Based Active Learning." European Conference on Machine Learning, 2006. doi:10.1007/11871842_68

Markdown

[Körner and Wrobel. "Multi-Class Ensemble-Based Active Learning." European Conference on Machine Learning, 2006.](https://mlanthology.org/ecmlpkdd/2006/korner2006ecml-multiclass/) doi:10.1007/11871842_68

BibTeX

@inproceedings{korner2006ecml-multiclass,
  title     = {{Multi-Class Ensemble-Based Active Learning}},
  author    = {Körner, Christine and Wrobel, Stefan},
  booktitle = {European Conference on Machine Learning},
  year      = {2006},
  pages     = {687-694},
  doi       = {10.1007/11871842_68},
  url       = {https://mlanthology.org/ecmlpkdd/2006/korner2006ecml-multiclass/}
}