Learning from Corrupted Binary Labels via Class-Probability Estimation
Abstract
Many supervised learning problems involve learning from samples whose labels are corrupted in some way. For example, each sample may have some constant probability of being incorrectly labelled (learning with label noise), or one may have a pool of unlabelled samples in lieu of negative samples (learning from positive and unlabelled data). This paper uses class-probability estimation to study these and other corruption processes belonging to the mutually contaminated distributions framework (Scott et al., 2013), with three conclusions. First, one can optimise balanced error and AUC without knowledge of the corruption process parameters. Second, given estimates of the corruption parameters, one can minimise a range of classification risks. Third, one can estimate the corruption parameters using only corrupted data. Experiments confirm the efficacy of class-probability estimation in learning from corrupted labels.
Cite
Text
Menon et al. "Learning from Corrupted Binary Labels via Class-Probability Estimation." International Conference on Machine Learning, 2015.Markdown
[Menon et al. "Learning from Corrupted Binary Labels via Class-Probability Estimation." International Conference on Machine Learning, 2015.](https://mlanthology.org/icml/2015/menon2015icml-learning/)BibTeX
@inproceedings{menon2015icml-learning,
title = {{Learning from Corrupted Binary Labels via Class-Probability Estimation}},
author = {Menon, Aditya and Van Rooyen, Brendan and Ong, Cheng Soon and Williamson, Bob},
booktitle = {International Conference on Machine Learning},
year = {2015},
pages = {125-134},
volume = {37},
url = {https://mlanthology.org/icml/2015/menon2015icml-learning/}
}