A Selective Sampling Strategy for Label Ranking

Abstract

We propose a novel active learning strategy based on the compression framework of [9] for label ranking functions which, given an input instance, predict a total order over a predefined set of alternatives. Our approach is theoretically motivated by an extension to ranking and active learning of Kääriäinen’s generalization bounds using unlabeled data [7], initially developed in the context of classification. The bounds we obtain suggest a selective sampling strategy provided that a sufficiently, yet reasonably large initial labeled dataset is provided. Experiments on Information Retrieval corpora from automatic text summarization and question/answering show that the proposed approach allows to substantially reduce the labeling effort in comparison to random and heuristic-based sampling strategies.

Cite

Text

Amini et al. "A Selective Sampling Strategy for Label Ranking." European Conference on Machine Learning, 2006. doi:10.1007/11871842_7

Markdown

[Amini et al. "A Selective Sampling Strategy for Label Ranking." European Conference on Machine Learning, 2006.](https://mlanthology.org/ecmlpkdd/2006/amini2006ecml-selective/) doi:10.1007/11871842_7

BibTeX

@inproceedings{amini2006ecml-selective,
  title     = {{A Selective Sampling Strategy for Label Ranking}},
  author    = {Amini, Massih-Reza and Usunier, Nicolas and Laviolette, François and Lacasse, Alexandre and Gallinari, Patrick},
  booktitle = {European Conference on Machine Learning},
  year      = {2006},
  pages     = {18-29},
  doi       = {10.1007/11871842_7},
  url       = {https://mlanthology.org/ecmlpkdd/2006/amini2006ecml-selective/}
}