Online Active Learning with Surrogate Loss Functions
Abstract
We derive a novel active learning algorithm in the streaming setting for binary classification tasks. The algorithm leverages weak labels to minimize the number of label requests, and trains a model to optimize a surrogate loss on a resulting set of labeled and weak-labeled points. Our algorithm jointly admits two crucial properties: theoretical guarantees in the general agnostic setting and a strong empirical performance. Our theoretical analysis shows that the algorithm attains favorable generalization and label complexity bounds, while our empirical study on 18 real-world datasets demonstrate that the algorithm outperforms standard baselines, including the Margin Algorithm, or Uncertainty Sampling, a high-performing active learning algorithm favored by practitioners.
Cite
Text
DeSalvo et al. "Online Active Learning with Surrogate Loss Functions." Neural Information Processing Systems, 2021.Markdown
[DeSalvo et al. "Online Active Learning with Surrogate Loss Functions." Neural Information Processing Systems, 2021.](https://mlanthology.org/neurips/2021/desalvo2021neurips-online/)BibTeX
@inproceedings{desalvo2021neurips-online,
title = {{Online Active Learning with Surrogate Loss Functions}},
author = {DeSalvo, Giulia and Gentile, Claudio and Thune, Tobias Sommer},
booktitle = {Neural Information Processing Systems},
year = {2021},
url = {https://mlanthology.org/neurips/2021/desalvo2021neurips-online/}
}