Class-Balanced Active Learning for Image Classification

Abstract

Active learning aims to reduce the labeling effort that is required to train algorithms by learning an acquisition function selecting the most relevant data for which a label should be requested from a large unlabeled data pool. Active learning is generally studied on balanced datasets where an equal amount of images per class is available. However, real-world datasets suffer from severe imbalanced classes, the so called long-tail distribution. We argue that this further complicates the active learning process, since the imbalanced data pool can result in suboptimal classifiers. To address this problem in the context of active learning, we proposed a general optimization framework that explicitly takes class-balancing into account. Results on three datasets showed that the method is general (it can be combined with most existing active learning algorithms) and can be effectively applied to boost the performance of both informative and representative-based active learning methods. In addition, we showed that also on balanced datasets our method generally results in a performance gain.

Cite

Text

Bengar et al. "Class-Balanced Active Learning for Image Classification." Winter Conference on Applications of Computer Vision, 2022.

Markdown

[Bengar et al. "Class-Balanced Active Learning for Image Classification." Winter Conference on Applications of Computer Vision, 2022.](https://mlanthology.org/wacv/2022/bengar2022wacv-classbalanced/)

BibTeX

@inproceedings{bengar2022wacv-classbalanced,
  title     = {{Class-Balanced Active Learning for Image Classification}},
  author    = {Bengar, Javad Zolfaghari and van de Weijer, Joost and Fuentes, Laura Lopez and Raducanu, Bogdan},
  booktitle = {Winter Conference on Applications of Computer Vision},
  year      = {2022},
  pages     = {1536-1545},
  url       = {https://mlanthology.org/wacv/2022/bengar2022wacv-classbalanced/}
}