Online Active Learning with Surrogate Loss Functions

Abstract

We derive a novel active learning algorithm in the streaming setting for binary classification tasks. The algorithm leverages weak labels to minimize the number of label requests, and trains a model to optimize a surrogate loss on a resulting set of labeled and weak-labeled points. Our algorithm jointly admits two crucial properties: theoretical guarantees in the general agnostic setting and a strong empirical performance. Our theoretical analysis shows that the algorithm attains favorable generalization and label complexity bounds, while our empirical study on 18 real-world datasets demonstrate that the algorithm outperforms standard baselines, including the Margin Algorithm, or Uncertainty Sampling, a high-performing active learning algorithm favored by practitioners.

Cite

Text

DeSalvo et al. "Online Active Learning with Surrogate Loss Functions." Neural Information Processing Systems, 2021.

Markdown

[DeSalvo et al. "Online Active Learning with Surrogate Loss Functions." Neural Information Processing Systems, 2021.](https://mlanthology.org/neurips/2021/desalvo2021neurips-online/)

BibTeX

@inproceedings{desalvo2021neurips-online,
  title     = {{Online Active Learning with Surrogate Loss Functions}},
  author    = {DeSalvo, Giulia and Gentile, Claudio and Thune, Tobias Sommer},
  booktitle = {Neural Information Processing Systems},
  year      = {2021},
  url       = {https://mlanthology.org/neurips/2021/desalvo2021neurips-online/}
}