ML Anthology
Authors
Search
About
Scherrer, Bruno
27 publications
NeurIPS
2025
AlphaBeta Is Not as Good as You Think: A Simple Class of Synthetic Games for a Better Analysis of Deterministic Game-Solving Algorithms
Raphael Boige
,
Amine Boumaza
,
Bruno Scherrer
NeurIPS
2020
Leverage the Average: An Analysis of KL Regularization in Reinforcement Learning
Nino Vieillard
,
Tadashi Kozuno
,
Bruno Scherrer
,
Olivier Pietquin
,
Remi Munos
,
Matthieu Geist
AISTATS
2020
Momentum in Reinforcement Learning
Nino Vieillard
,
Bruno Scherrer
,
Olivier Pietquin
,
Matthieu Geist
ICML
2019
A Theory of Regularized Markov Decision Processes
Matthieu Geist
,
Bruno Scherrer
,
Olivier Pietquin
AAAI
2019
How to Combine Tree-Search Methods in Reinforcement Learning
Yonathan Efroni
,
Gal Dalal
,
Bruno Scherrer
,
Shie Mannor
ICML
2018
Beyond the One-Step Greedy Approach in Reinforcement Learning
Yonathan Efroni
,
Gal Dalal
,
Bruno Scherrer
,
Shie Mannor
NeurIPS
2018
Multiple-Step Greedy Policies in Approximate and Online Reinforcement Learning
Yonathan Efroni
,
Gal Dalal
,
Bruno Scherrer
,
Shie Mannor
AISTATS
2016
On the Use of Non-Stationary Strategies for Solving Two-Player Zero-Sum Markov Games
Julien Pérolat
,
Bilal Piot
,
Bruno Scherrer
,
Olivier Pietquin
ICML
2016
Softened Approximate Policy Iteration for Markov Games
Julien Pérolat
,
Bilal Piot
,
Matthieu Geist
,
Bruno Scherrer
,
Olivier Pietquin
ICML
2015
Approximate Dynamic Programming for Two-Player Zero-Sum Markov Games
Julien Perolat
,
Bruno Scherrer
,
Bilal Piot
,
Olivier Pietquin
JMLR
2015
Approximate Modified Policy Iteration and Its Application to the Game of Tetris
Bruno Scherrer
,
Mohammad Ghavamzadeh
,
Victor Gabillon
,
Boris Lesner
,
Matthieu Geist
ICML
2015
Non-Stationary Approximate Modified Policy Iteration
Boris Lesner
,
Bruno Scherrer
ICML
2015
On the Rate of Convergence and Error Bounds for LSTD(λ)
Manel Tagorti
,
Bruno Scherrer
ICML
2014
Approximate Policy Iteration Schemes: A Comparison
Bruno Scherrer
ECML-PKDD
2014
Local Policy Search in a Convex Space and Conservative Policy Iteration as Boosted Policy Search
Bruno Scherrer
,
Matthieu Geist
JMLR
2014
Off-Policy Learning with Eligibility Traces: A Survey
Matthieu Geist
,
Bruno Scherrer
NeurIPS
2013
Approximate Dynamic Programming Finally Performs Well in the Game of Tetris
Victor Gabillon
,
Mohammad Ghavamzadeh
,
Bruno Scherrer
NeurIPS
2013
Improved and Generalized Upper Bounds on the Complexity of Policy Iteration
Bruno Scherrer
JMLR
2013
Performance Bounds for Λ Policy Iteration and Application to the Game of Tetris
Bruno Scherrer
ICML
2012
A Dantzig Selector Approach to Temporal Difference Learning
Matthieu Geist
,
Bruno Scherrer
,
Alessandro Lazaric
,
Mohammad Ghavamzadeh
ICML
2012
Approximate Modified Policy Iteration
Bruno Scherrer
,
Victor Gabillon
,
Mohammad Ghavamzadeh
,
Matthieu Geist
NeurIPS
2012
On the Use of Non-Stationary Policies for Stationary Infinite-Horizon Markov Decision Processes
Bruno Scherrer
,
Boris Lesner
ICML
2011
Classification-Based Policy Iteration with a Critic
Victor Gabillon
,
Alessandro Lazaric
,
Mohammad Ghavamzadeh
,
Bruno Scherrer
ICML
2010
Least-Squares Policy Iteration: Bias-Variance Trade-Off in Control Problems
Christophe Thiery
,
Bruno Scherrer
ICML
2010
Should One Compute the Temporal Difference Fix Point or Minimize the Bellman Residual? the Unified Oblique Projection View
Bruno Scherrer
NeurIPS
2008
Biasing Approximate Dynamic Programming with a Lower Discount Factor
Marek Petrik
,
Bruno Scherrer
IJCAI
2003
Modular Self-Organization for a Long-Living Autonomous Agent
Bruno Scherrer