Ernst, Damien

17 publications

ICML 2025 A Theoretical Justification for Asymmetric Actor-Critic Algorithms Gaspard Lambrechts, Damien Ernst, Aditya Mahajan
NeurIPSW 2024 Cost Estimation in Unit Commitment Problems Using Simulation-Based Inference Matthias Pirlet, Adrien Bolland, Gilles Louppe, Damien Ernst
NeurIPS 2023 IMP-MARL: A Suite of Environments for Large-Scale Infrastructure Management Planning via MARL Pascal Leroy, Pablo G. Morato, Jonathan Pisane, Athanasios Kolios, Damien Ernst
ICMLW 2023 Informed POMDP: Leveraging Additional Information in Model-Based RL Gaspard Lambrechts, Adrien Bolland, Damien Ernst
TMLR 2023 Policy Gradient Algorithms Implicitly Optimize by Continuation Adrien Bolland, Gilles Louppe, Damien Ernst
ICMLW 2023 Policy Gradient Algorithms Implicitly Optimize by Continuation Adrien Bolland, Gilles Louppe, Damien Ernst
JAIR 2022 Jointly Learning Environments and Control Policies with Projected Stochastic Gradient Ascent Adrien Bolland, Ioannis Boukas, Mathias Berger, Damien Ernst
TMLR 2022 Recurrent Networks, Hidden States and Beliefs in Partially Observable Environments Gaspard Lambrechts, Adrien Bolland, Damien Ernst
NeurIPSW 2022 Value-Based CTDE Methods in Symmetric Two-Team Markov Game: From Cooperation to Team Competition Pascal Leroy, Jonathan Pisane, Damien Ernst
MLJ 2021 A Deep Reinforcement Learning Framework for Continuous Intraday Market Bidding Ioannis Boukas, Damien Ernst, Thibaut Théate, Adrien Bolland, Alexandre Huynen, Martin Buchwald, Christelle Wynants, Bertrand Cornélusse
IJCAI 2020 On Overfitting and Asymptotic Bias in Batch Reinforcement Learning with Partial Observability (Extended Abstract) Vincent François-Lavet, Guillaume Rabusseau, Joelle Pineau, Damien Ernst, Raphael Fonteneau
JAIR 2019 On Overfitting and Asymptotic Bias in Batch Reinforcement Learning with Partial Observability Vincent François-Lavet, Guillaume Rabusseau, Joelle Pineau, Damien Ernst, Raphael Fonteneau
JMLR 2013 Optimal Discovery with Probabilistic Expert Advice: Finite Time Analysis and Macroscopic Optimality Sébastien Bubeck, Damien Ernst, Aurélien Garivier
AISTATS 2010 Model-Free Monte Carlo-like Policy Evaluation Raphael Fonteneau, Susan Murphy, Louis Wehenkel, Damien Ernst
MLJ 2006 Extremely Randomized Trees Pierre Geurts, Damien Ernst, Louis Wehenkel
JMLR 2005 Tree-Based Batch Mode Reinforcement Learning Damien Ernst, Pierre Geurts, Louis Wehenkel
ECML-PKDD 2003 Iteratively Extending Time Horizon Reinforcement Learning Damien Ernst, Pierre Geurts, Louis Wehenkel