Generality-Training of a Classifier for Improved Calibration in Unseen Contexts

Abstract

Artificial neural networks tend to output class probabilities that are miscalibrated, i . e ., their reported uncertainty is not a very good indicator of how much we should trust the model. Consequently, methods have been developed to improve the model’s predictive uncertainty, both during training and post-hoc. Even if the model is calibrated on the domain used in training, it typically becomes over-confident when applied on slightly different target domains, e.g. due to perturbations or shifts in the data. The model can be recalibrated for a fixed list of target domains, but its performance can still be poor on unseen target domains. To address this issue, we propose a generality-training procedure that learns a modified head for the neural network to achieve better calibration generalization to new domains while retaining calibration performance on the given domains. This generality-head is trained on multiple domains using a new objective function with increased emphasis on the calibration loss compared to cross-entropy. Such training results in a more general model in the sense of not only better calibration but also better accuracy on unseen domains, as we demonstrate experimentally on multiple datasets. The code and supplementary for the paper is available ( https://github.com/bsl-traveller/CaliGen.git ).

Cite

Text

Leelar and Kull. "Generality-Training of a Classifier for Improved Calibration in Unseen Contexts." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2023. doi:10.1007/978-3-031-43424-2_23

Markdown

[Leelar and Kull. "Generality-Training of a Classifier for Improved Calibration in Unseen Contexts." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2023.](https://mlanthology.org/ecmlpkdd/2023/leelar2023ecmlpkdd-generalitytraining/) doi:10.1007/978-3-031-43424-2_23

BibTeX

@inproceedings{leelar2023ecmlpkdd-generalitytraining,
  title     = {{Generality-Training of a Classifier for Improved Calibration in Unseen Contexts}},
  author    = {Leelar, Bhawani Shankar and Kull, Meelis},
  booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
  year      = {2023},
  pages     = {374-391},
  doi       = {10.1007/978-3-031-43424-2_23},
  url       = {https://mlanthology.org/ecmlpkdd/2023/leelar2023ecmlpkdd-generalitytraining/}
}