Semi-Supervised Regression with Heteroscedastic Pseudo-Labels
Abstract
Pseudo-labeling is a commonly used paradigm in semi-supervised learning, yet its application to semi-supervised regression (SSR) remains relatively under-explored. Unlike classification, where pseudo-labels are discrete and confidence-based filtering is effective, SSR involves continuous outputs with heteroscedastic noise, making it challenging to assess pseudo-label reliability. As a result, naive pseudo-labeling can lead to error accumulation and overfitting to incorrect labels. To address this, we propose an uncertainty-aware pseudo-labeling framework that dynamically adjusts pseudo-label influence from a bi-level optimization perspective. By jointly minimizing empirical risk over all data and optimizing uncertainty estimates to enhance generalization on labeled data, our method effectively mitigates the impact of unreliable pseudo-labels. We provide theoretical insights and extensive experiments to validate our approach across various benchmark SSR datasets, and the results demonstrate superior robustness and performance compared to existing methods.
Cite
Text
Sun et al. "Semi-Supervised Regression with Heteroscedastic Pseudo-Labels." Advances in Neural Information Processing Systems, 2025.Markdown
[Sun et al. "Semi-Supervised Regression with Heteroscedastic Pseudo-Labels." Advances in Neural Information Processing Systems, 2025.](https://mlanthology.org/neurips/2025/sun2025neurips-semisupervised/)BibTeX
@inproceedings{sun2025neurips-semisupervised,
title = {{Semi-Supervised Regression with Heteroscedastic Pseudo-Labels}},
author = {Sun, Xueqing and Wang, Renzhen and Wang, Quanziang and Wu, Yichen and Jia, Xixi and Meng, Deyu},
booktitle = {Advances in Neural Information Processing Systems},
year = {2025},
url = {https://mlanthology.org/neurips/2025/sun2025neurips-semisupervised/}
}