CAP: Conformalized Abstention Policies for Context-Adaptive Risk Management for LLMs and VLMs

Abstract

Large Language and Vision-Language Models (LLMs/VLMs) are increasingly deployed in high-stakes domains where predictive failures can be costly. Conformal Prediction (CP) offers distribution-free uncertainty quantification with finite-sample coverage guarantees, but its reliance on a globally fixed risk level enforces a uniform trade-off between coverage and informativeness, misaligned with the instance-specific uncertainty patterns of modern foundation models. We propose the framework of Conformalized Abstention Policy (CAP), a novel framework that integrates CP with deep Reinforcement Learning (RL) to learn per-instance abstention policies. CAP trains a utility-driven policy to dynamically select the conformal risk level for each input, balancing point prediction, set prediction, and full abstention based on downstream utility. We specifically introduce Policy-Calibrated Coverage, a theoretical guarantee ensuring that the empirical coverage of the learned policy reliably estimates its true expected performance. Extensive experiments show that CAP maintains the 90% target coverage while substantially outperforming static CP baselines: improving hallucination detection AUROC by up to 22.2%, uncertainty-guided selective generation AUARC by 21.2%, and reducing calibration error by over 70%. CAP also extends to free-form generation by managing the trade-off between a detailed and factual response on a per-instance basis by learning an optimal risk level for sub-claim retention.

Cite

Text

Tayebati et al. "CAP: Conformalized Abstention Policies for Context-Adaptive Risk Management for LLMs and VLMs." Proceedings of the 17th Asian Conference on Machine Learning, 2025.

Markdown

[Tayebati et al. "CAP: Conformalized Abstention Policies for Context-Adaptive Risk Management for LLMs and VLMs." Proceedings of the 17th Asian Conference on Machine Learning, 2025.](https://mlanthology.org/acml/2025/tayebati2025acml-cap/)

BibTeX

@inproceedings{tayebati2025acml-cap,
  title     = {{CAP: Conformalized Abstention Policies for Context-Adaptive Risk Management for LLMs and VLMs}},
  author    = {Tayebati, Sina and Kumar, Divake and Darabi, Nastaran and Jayasuriya, Dinithi and Tulabandhula, Theja and Krishnan, Ranganath and Trivedi, Amit Ranjan},
  booktitle = {Proceedings of the 17th Asian Conference on Machine Learning},
  year      = {2025},
  pages     = {926-941},
  volume    = {304},
  url       = {https://mlanthology.org/acml/2025/tayebati2025acml-cap/}
}