Binary Classification from Positive Data with Skewed Confidence

Abstract

Positive-confidence (Pconf) classification [Ishida et al., 2018] is a promising weakly-supervised learning method which trains a binary classifier only from positive data equipped with confidence. However, in practice, the confidence may be skewed by bias arising in an annotation process. The Pconf classifier cannot be properly learned with skewed confidence, and consequently, the classification performance might be deteriorated. In this paper, we introduce the parameterized model of the skewed confidence, and propose the method for selecting the hyperparameter which cancels out the negative impact of the skewed confidence under the assumption that we have the misclassification rate of positive samples as a prior knowledge. We demonstrate the effectiveness of the proposed method through a synthetic experiment with simple linear models and benchmark problems with neural network models. We also apply our method to drivers’ drowsiness prediction to show that it works well with a real-world problem where confidence is obtained based on manual annotation.

Cite

Text

Shinoda et al. "Binary Classification from Positive Data with Skewed Confidence." International Joint Conference on Artificial Intelligence, 2020. doi:10.24963/IJCAI.2020/460

Markdown

[Shinoda et al. "Binary Classification from Positive Data with Skewed Confidence." International Joint Conference on Artificial Intelligence, 2020.](https://mlanthology.org/ijcai/2020/shinoda2020ijcai-binary/) doi:10.24963/IJCAI.2020/460

BibTeX

@inproceedings{shinoda2020ijcai-binary,
  title     = {{Binary Classification from Positive Data with Skewed Confidence}},
  author    = {Shinoda, Kazuhiko and Kaji, Hirotaka and Sugiyama, Masashi},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2020},
  pages     = {3328-3334},
  doi       = {10.24963/IJCAI.2020/460},
  url       = {https://mlanthology.org/ijcai/2020/shinoda2020ijcai-binary/}
}