Learning Sample-Aware Threshold for Semi-Supervised Learning

Abstract

Pseudo-labeling methods are popular in semi-supervised learning (SSL). Their performance heavily relies on a proper threshold to generate hard labels for unlabeled data. To this end, most existing studies resort to a manually pre-specified function to adjust the threshold, which, however, requires prior knowledge and suffers from the scalability issue. In this paper, we propose a novel method named Meta-Threshold, which learns a dynamic confidence threshold for each unlabeled instance and does not require extra hyperparameters except a learning rate. Specifically, the instance-level confidence threshold is automatically learned by an extra network in a meta-learning manner. Considering limited labeled data as meta-data, the overall training objective of the classifier network and the meta-net can be formulated as a nested optimization problem that can be solved by a bi-level optimization scheme. Furthermore, by replacing the indicator function existed in the pseudo-labeling with a surrogate function, we theoretically provide the convergence of our training procedure, while discussing the training complexity and proposing a strategy to reduce its time cost. Extensive experiments and analyses demonstrate the effectiveness of our method on both typical and imbalanced SSL tasks.

Cite

Text

Wei et al. "Learning Sample-Aware Threshold for Semi-Supervised Learning." Machine Learning, 2024. doi:10.1007/S10994-023-06425-7

Markdown

[Wei et al. "Learning Sample-Aware Threshold for Semi-Supervised Learning." Machine Learning, 2024.](https://mlanthology.org/mlj/2024/wei2024mlj-learning/) doi:10.1007/S10994-023-06425-7

BibTeX

@article{wei2024mlj-learning,
  title     = {{Learning Sample-Aware Threshold for Semi-Supervised Learning}},
  author    = {Wei, Qi and Feng, Lei and Sun, Haoliang and Wang, Ren and He, Rundong and Yin, Yilong},
  journal   = {Machine Learning},
  year      = {2024},
  pages     = {5423-5445},
  doi       = {10.1007/S10994-023-06425-7},
  volume    = {113},
  url       = {https://mlanthology.org/mlj/2024/wei2024mlj-learning/}
}