Scherrer, Bruno

27 publications

NeurIPS 2025 AlphaBeta Is Not as Good as You Think: A Simple Class of Synthetic Games for a Better Analysis of Deterministic Game-Solving Algorithms Raphael Boige, Amine Boumaza, Bruno Scherrer
NeurIPS 2020 Leverage the Average: An Analysis of KL Regularization in Reinforcement Learning Nino Vieillard, Tadashi Kozuno, Bruno Scherrer, Olivier Pietquin, Remi Munos, Matthieu Geist
AISTATS 2020 Momentum in Reinforcement Learning Nino Vieillard, Bruno Scherrer, Olivier Pietquin, Matthieu Geist
ICML 2019 A Theory of Regularized Markov Decision Processes Matthieu Geist, Bruno Scherrer, Olivier Pietquin
AAAI 2019 How to Combine Tree-Search Methods in Reinforcement Learning Yonathan Efroni, Gal Dalal, Bruno Scherrer, Shie Mannor
ICML 2018 Beyond the One-Step Greedy Approach in Reinforcement Learning Yonathan Efroni, Gal Dalal, Bruno Scherrer, Shie Mannor
NeurIPS 2018 Multiple-Step Greedy Policies in Approximate and Online Reinforcement Learning Yonathan Efroni, Gal Dalal, Bruno Scherrer, Shie Mannor
AISTATS 2016 On the Use of Non-Stationary Strategies for Solving Two-Player Zero-Sum Markov Games Julien Pérolat, Bilal Piot, Bruno Scherrer, Olivier Pietquin
ICML 2016 Softened Approximate Policy Iteration for Markov Games Julien Pérolat, Bilal Piot, Matthieu Geist, Bruno Scherrer, Olivier Pietquin
ICML 2015 Approximate Dynamic Programming for Two-Player Zero-Sum Markov Games Julien Perolat, Bruno Scherrer, Bilal Piot, Olivier Pietquin
JMLR 2015 Approximate Modified Policy Iteration and Its Application to the Game of Tetris Bruno Scherrer, Mohammad Ghavamzadeh, Victor Gabillon, Boris Lesner, Matthieu Geist
ICML 2015 Non-Stationary Approximate Modified Policy Iteration Boris Lesner, Bruno Scherrer
ICML 2015 On the Rate of Convergence and Error Bounds for LSTD(λ) Manel Tagorti, Bruno Scherrer
ICML 2014 Approximate Policy Iteration Schemes: A Comparison Bruno Scherrer
ECML-PKDD 2014 Local Policy Search in a Convex Space and Conservative Policy Iteration as Boosted Policy Search Bruno Scherrer, Matthieu Geist
JMLR 2014 Off-Policy Learning with Eligibility Traces: A Survey Matthieu Geist, Bruno Scherrer
NeurIPS 2013 Approximate Dynamic Programming Finally Performs Well in the Game of Tetris Victor Gabillon, Mohammad Ghavamzadeh, Bruno Scherrer
NeurIPS 2013 Improved and Generalized Upper Bounds on the Complexity of Policy Iteration Bruno Scherrer
JMLR 2013 Performance Bounds for Λ Policy Iteration and Application to the Game of Tetris Bruno Scherrer
ICML 2012 A Dantzig Selector Approach to Temporal Difference Learning Matthieu Geist, Bruno Scherrer, Alessandro Lazaric, Mohammad Ghavamzadeh
ICML 2012 Approximate Modified Policy Iteration Bruno Scherrer, Victor Gabillon, Mohammad Ghavamzadeh, Matthieu Geist
NeurIPS 2012 On the Use of Non-Stationary Policies for Stationary Infinite-Horizon Markov Decision Processes Bruno Scherrer, Boris Lesner
ICML 2011 Classification-Based Policy Iteration with a Critic Victor Gabillon, Alessandro Lazaric, Mohammad Ghavamzadeh, Bruno Scherrer
ICML 2010 Least-Squares Policy Iteration: Bias-Variance Trade-Off in Control Problems Christophe Thiery, Bruno Scherrer
ICML 2010 Should One Compute the Temporal Difference Fix Point or Minimize the Bellman Residual? the Unified Oblique Projection View Bruno Scherrer
NeurIPS 2008 Biasing Approximate Dynamic Programming with a Lower Discount Factor Marek Petrik, Bruno Scherrer
IJCAI 2003 Modular Self-Organization for a Long-Living Autonomous Agent Bruno Scherrer