Clavier, Pierre

5 publications

NeurIPS 2025 ShiQ: Bringing Back Bellman to LLMs Pierre Clavier, Nathan Grinsztajn, Raphaƫl Avalos, Yannis Flet-Berliac, Irem Ergun, Omar Darwiche Domingues, Olivier Pietquin, Pierre Harvey Richemond, Florian Strub, Matthieu Geist
ICML 2024 $\mathtt{VITS}$ : Variational Inference Thompson Sampling for Contextual Bandits Pierre Clavier, Tom Huix, Alain Oliviero Durmus
NeurIPS 2024 Near-Optimal Distributionally Robust Reinforcement Learning with General $L_p$ Norms Pierre Clavier, Laixi Shi, Erwan Le Pennec, Eric Mazumdar, Adam Wierman, Matthieu Geist
NeurIPS 2024 Time-Constrained Robust MDPs Adil Zouitine, David Bertoin, Pierre Clavier, Matthieu Geist, Emmanuel Rachelson
UAI 2024 Towards Minimax Optimality of Model-Based Robust Reinforcement Learning Pierre Clavier, Erwan Le Pennec, Matthieu Geist