Stangel, Paul

1 publications

ICLR 2026 Rewarding Doubt: A Reinforcement Learning Approach to Calibrated Confidence Expression of Large Language Models David Bani-Harouni, Chantal Pellegrini, Paul Stangel, Ege Özsoy, Kamilia Zaripova, Nassir Navab, Matthias Keicher