Distributionally Robust Loss for Long-Tailed Multi-Label Image Classification

Abstract

The binary cross-entropy (BCE) loss function is widely utilized in multi-label classification (MLC) tasks, treating each label independently. The log-sum-exp pairwise (LSEP) loss, which emphasizes higher logits for positive classes over negative ones within a sample and accounts for label dependencies, has demonstrated effectiveness for MLC. However, our experiments suggest that its performance in long-tailed multi-label classification (LTMLC) appears to be inferior to that of BCE. In this study, we investigate the impact of the log-sum-exp operation on recognition and explore optimization avenues. Our observations reveal two primary shortcomings of LSEP that lead to its poor performance in LTMLC: 1) the indiscriminate use of label dependencies without consideration of the distribution shift between training and test sets, and 2) the overconfidence in negative labels with features similar to those of positive labels. To mitigate these problems, we propose a distributionally robust loss (DR), which includes class-wise LSEP and a negative gradient constraint. Additionally, our findings indicate that the BCE-based loss is somewhat complementary to the LSEP-based loss, offering enhanced performance upon integration. Extensive experiments conducted on two LTMLC datasets, VOC-LT and COCO-LT, demonstrate the consistent effectiveness of our proposed method. Code: https://github.com/ Kunmonkey/DR-Loss.

Cite

Text

Lin et al. "Distributionally Robust Loss for Long-Tailed Multi-Label Image Classification." Proceedings of the European Conference on Computer Vision (ECCV), 2024. doi:10.1007/978-3-031-73414-4_24

Markdown

[Lin et al. "Distributionally Robust Loss for Long-Tailed Multi-Label Image Classification." Proceedings of the European Conference on Computer Vision (ECCV), 2024.](https://mlanthology.org/eccv/2024/lin2024eccv-distributionally/) doi:10.1007/978-3-031-73414-4_24

BibTeX

@inproceedings{lin2024eccv-distributionally,
  title     = {{Distributionally Robust Loss for Long-Tailed Multi-Label Image Classification}},
  author    = {Lin, Dekun and Cui, Zhe and Chen, Rui and Peng, Tailai and Xie, Xinran and Qin, Xiaolin},
  booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
  year      = {2024},
  doi       = {10.1007/978-3-031-73414-4_24},
  url       = {https://mlanthology.org/eccv/2024/lin2024eccv-distributionally/}
}