Multi-Label Ranking from Positive and Unlabeled Data

Abstract

In this paper, we specifically examine the training of a multi-label classifier from data with incompletely assigned labels. This problem is fundamentally important in many multi-label applications because it is almost impossible for human annotators to assign a complete set of labels, although their judgments are reliable. In other words, a multi-label dataset usually has properties by which (1) assigned labels are definitely positive and (2) some labels are absent but are still considered positive. Such a setting has been studied as a positive and unlabeled (PU) classification problem in a binary setting. We treat incomplete label assignment problems as a multi-label PU ranking, which is an extension of classical binary PU problems to the well-studied rank-based multi-label classification. We derive the conditions that should be satisfied to cancel the negative effects of label incompleteness. Our experimentally obtained results demonstrate the effectiveness of these conditions.

Cite

Text

Kanehira and Harada. "Multi-Label Ranking from Positive and Unlabeled Data." Conference on Computer Vision and Pattern Recognition, 2016. doi:10.1109/CVPR.2016.555

Markdown

[Kanehira and Harada. "Multi-Label Ranking from Positive and Unlabeled Data." Conference on Computer Vision and Pattern Recognition, 2016.](https://mlanthology.org/cvpr/2016/kanehira2016cvpr-multilabel/) doi:10.1109/CVPR.2016.555

BibTeX

@inproceedings{kanehira2016cvpr-multilabel,
  title     = {{Multi-Label Ranking from Positive and Unlabeled Data}},
  author    = {Kanehira, Atsushi and Harada, Tatsuya},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2016},
  doi       = {10.1109/CVPR.2016.555},
  url       = {https://mlanthology.org/cvpr/2016/kanehira2016cvpr-multilabel/}
}