AUC Optimization from Multiple Unlabeled Datasets
Abstract
Weakly supervised learning aims to make machine learning more powerful when the perfect supervision is unavailable, and has attracted much attention from researchers. Among the various scenarios of weak supervision, one of the most challenging cases is learning from multiple unlabeled (U) datasets with only a little knowledge of the class priors, or U^m learning for short. In this paper, we study the problem of building an AUC (area under ROC curve) optimal model from multiple unlabeled datasets, which maximizes the pairwise ranking ability of the classifier. We propose U^m-AUC, an AUC optimization approach that converts the U^m data into a multi-label AUC optimization problem, and can be trained efficiently. We show that the proposed U^m-AUC is effective theoretically and empirically.
Cite
Text
Xie et al. "AUC Optimization from Multiple Unlabeled Datasets." AAAI Conference on Artificial Intelligence, 2024. doi:10.1609/AAAI.V38I14.29538Markdown
[Xie et al. "AUC Optimization from Multiple Unlabeled Datasets." AAAI Conference on Artificial Intelligence, 2024.](https://mlanthology.org/aaai/2024/xie2024aaai-auc/) doi:10.1609/AAAI.V38I14.29538BibTeX
@inproceedings{xie2024aaai-auc,
title = {{AUC Optimization from Multiple Unlabeled Datasets}},
author = {Xie, Zheng and Liu, Yu and Li, Ming},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2024},
pages = {16058-16066},
doi = {10.1609/AAAI.V38I14.29538},
url = {https://mlanthology.org/aaai/2024/xie2024aaai-auc/}
}