Thilges, Serge

2 publications

ICLR 2026 TROLL: Trust Regions Improve Reinforcement Learning for Large Language Models Philipp Becker, Niklas Freymuth, Serge Thilges, Fabian Otto, Gerhard Neumann
ICLR 2024 Open the Black Box: Step-Based Policy Updates for Temporally-Correlated Episodic Reinforcement Learning Ge Li, Hongyi Zhou, Dominik Roth, Serge Thilges, Fabian Otto, Rudolf Lioutikov, Gerhard Neumann