Gros, Sebastien

3 publications

TMLR 2026 Safe Reinforcement Learning Using Action Projection: Safeguard the Policy or the Environment? Hannah Markgraf, Shambhuraj Sawant, Hanna Krasowski, Lukas Schäfer, Sebastien Gros, Matthias Althoff
NeurIPS 2025 Offline Guarded Safe Reinforcement Learning for Medical Treatment Optimization Strategies Runze Yan, Xun Shen, Akifumi Wachi, Sebastien Gros, Anni Zhao, Xiao Hu
NeurIPS 2024 Flipping-Based Policy for Chance-Constrained Markov Decision Processes Xun Shen, Shuo Jiang, Akifumi Wachi, Kazumune Hashimoto, Sebastien Gros