Rebeschini, Patrick
29 publications
NeurIPS
2025
On the Necessity of Adaptive Regularisation: Optimal Anytime Online Learning on $\boldsymbol{\ell_p}$-Balls
ICLR
2024
Sample-Efficiency in Multi-Batch Reinforcement Learning: The Need for Dimension-Dependent Adaptivity
NeurIPS
2023
A Novel Framework for Policy Mirror Descent with General Parameterization and Linear Convergence
NeurIPS
2023
Optimal Convergence Rate for Exact Policy Mirror Descent in Discounted Markov Decision Processes