Logit Mixing Training for More Reliable and Accurate Prediction

Abstract

When a person solves the multi-choice problem, she considers not only what is the answer but also what is not the answer. Knowing what choice is not the answer and utilizing the relationships between choices, she can improve the prediction accuracy. Inspired by this human reasoning process, we propose a new training strategy to fully utilize inter-class relationships, namely LogitMix. Our strategy is combined with recent data augmentation techniques, e.g., Mixup, Manifold Mixup, CutMix, and PuzzleMix. Then, we suggest using a mixed logit, i.e., a mixture of two logits, as an auxiliary training objective. Since the logit can preserve both positive and negative inter-class relationships, it can impose a network to learn the probability of wrong answers correctly. Our extensive experimental results on the image- and language-based tasks demonstrate that LogitMix achieves state-of-the-art performance among recent data augmentation techniques regarding calibration error and prediction accuracy.

Cite

Text

Bang et al. "Logit Mixing Training for More Reliable and Accurate Prediction." International Joint Conference on Artificial Intelligence, 2022. doi:10.24963/IJCAI.2022/390

Markdown

[Bang et al. "Logit Mixing Training for More Reliable and Accurate Prediction." International Joint Conference on Artificial Intelligence, 2022.](https://mlanthology.org/ijcai/2022/bang2022ijcai-logit/) doi:10.24963/IJCAI.2022/390

BibTeX

@inproceedings{bang2022ijcai-logit,
  title     = {{Logit Mixing Training for More Reliable and Accurate Prediction}},
  author    = {Bang, Duhyeon and Baek, Kyungjune and Kim, Jiwoo and Jeon, Yunho and Kim, Jin-Hwa and Kim, Jiwon and Lee, Jongwuk and Shim, Hyunjung},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2022},
  pages     = {2812-2819},
  doi       = {10.24963/IJCAI.2022/390},
  url       = {https://mlanthology.org/ijcai/2022/bang2022ijcai-logit/}
}