Diverse Ensembles for Active Learning

Abstract

Query by Committee is an effective approach to selective sampling in which disagreement amongst an ensemble of hypotheses is used to select data for labeling. Query by Bagging and Query by Boosting are two practical implementations of this approach that use Bagging and Boosting, respectively, to build the committees. For effective active learning, it is critical that the committee be made up of consistent hypotheses that are very different from each other. Decorate is a recently developed method that directly constructs such diverse committees using artificial training data. This paper introduces Active-Decorate, which uses Decorate committees to select good training examples. Extensive experimental results demonstrate that, in general, Active-Decorate outperforms both Query by Bagging and Query by Boosting.

Cite

Text

Melville and Mooney. "Diverse Ensembles for Active Learning." International Conference on Machine Learning, 2004. doi:10.1145/1015330.1015385

Markdown

[Melville and Mooney. "Diverse Ensembles for Active Learning." International Conference on Machine Learning, 2004.](https://mlanthology.org/icml/2004/melville2004icml-diverse/) doi:10.1145/1015330.1015385

BibTeX

@inproceedings{melville2004icml-diverse,
  title     = {{Diverse Ensembles for Active Learning}},
  author    = {Melville, Prem and Mooney, Raymond J.},
  booktitle = {International Conference on Machine Learning},
  year      = {2004},
  doi       = {10.1145/1015330.1015385},
  url       = {https://mlanthology.org/icml/2004/melville2004icml-diverse/}
}