ML Anthology
Authors
Search
About
Thilges, Serge
2 publications
ICLR
2026
TROLL: Trust Regions Improve Reinforcement Learning for Large Language Models
Philipp Becker
,
Niklas Freymuth
,
Serge Thilges
,
Fabian Otto
,
Gerhard Neumann
ICLR
2024
Open the Black Box: Step-Based Policy Updates for Temporally-Correlated Episodic Reinforcement Learning
Ge Li
,
Hongyi Zhou
,
Dominik Roth
,
Serge Thilges
,
Fabian Otto
,
Rudolf Lioutikov
,
Gerhard Neumann