Gaujal, Bruno

5 publications

COLT 2025 Logarithmic Regret of Exploration in Average Reward Markov Decision Processes Victor Boone, Bruno Gaujal

AISTATS 2023 Identification of Blackwell Optimal Policies for Deterministic MDPs Victor Boone, Bruno Gaujal

ICML 2023 The Regret of Exploration and the Control of Bad Episodes in Reinforcement Learning Victor Boone, Bruno Gaujal

TMLR 2022 Learning Algorithms for Markovian Bandits:\\Is Posterior Sampling More Scalable than Optimism? Nicolas Gast, Bruno Gaujal, Kimang Khun

NeurIPS 2022 Reinforcement Learning in a Birth and Death Process: Breaking the Dependence on the State Space Jonatha Anselmi, Bruno Gaujal, Louis-Sébastien Rebuffi