Post-Hoc Estimators for Learning to Defer to an Expert
Abstract
Many practical settings allow a learner to defer predictions to one or more costly experts. For example, the learning to defer paradigm allows a learner to defer to a human expert, at some monetary cost. Similarly, the adaptive inference paradigm allows a base model to defer to one or more large models, at some computational cost. The goal in these settings is to learn classification and deferral mechanisms to optimise a suitable accuracy-cost tradeoff. To achieve this, a central issue studied in prior work is the design of a coherent loss function for both mechanisms. In this work, we demonstrate that existing losses have two subtle limitations: they can encourage underfitting when there is a high cost of deferring, and the deferral function can have a weak dependence on the base model predictions. To resolve these issues, we propose a post-hoc training scheme: we train a deferral function on top of a base model, with the objective of predicting to defer when the base model's error probability exceeds the cost of the expert model. This may be viewed as applying a partial surrogate to the ideal deferral loss, which can lead to a tighter approximation and thus better performance. Empirically, we verify the efficacy of post-hoc training on benchmarks for learning to defer and adaptive inference.
Cite
Text
Narasimhan et al. "Post-Hoc Estimators for Learning to Defer to an Expert." Neural Information Processing Systems, 2022.Markdown
[Narasimhan et al. "Post-Hoc Estimators for Learning to Defer to an Expert." Neural Information Processing Systems, 2022.](https://mlanthology.org/neurips/2022/narasimhan2022neurips-posthoc/)BibTeX
@inproceedings{narasimhan2022neurips-posthoc,
title = {{Post-Hoc Estimators for Learning to Defer to an Expert}},
author = {Narasimhan, Harikrishna and Jitkrittum, Wittawat and Menon, Aditya K and Rawat, Ankit and Kumar, Sanjiv},
booktitle = {Neural Information Processing Systems},
year = {2022},
url = {https://mlanthology.org/neurips/2022/narasimhan2022neurips-posthoc/}
}