Simão, Thiago D.

19 publications

NeurIPS 2025 On Evaluating Policies for Robust POMDPs Merlijn Krale, Eline M. Bovy, Maris F. L. Galesloot, Thiago D. Simão, Nils Jansen
ICLR 2025 Robust Transfer of Safety-Constrained Reinforcement Learning Agents Markel Zubia, Thiago D. Simão, Nils Jansen
ICLR 2025 Safety-Prioritizing Curricula for Constrained Reinforcement Learning Cevahir Koprulu, Thiago D. Simão, Nils Jansen, Ufuk Topcu
JAIR 2025 Scaling Safe Policy Improvement: Monte Carlo Tree Search and Policy Iteration Strategies Federico Bianchi, Alberto Castellini, Edoardo Zorzi, Thiago D. Simão, Matthijs T. J. Spaan, Alessandro Farinelli
AAAI 2024 Factored Online Planning in Many-Agent POMDPs Maris F. L. Galesloot, Thiago D. Simão, Sebastian Junges, Nils Jansen
AAAI 2024 Robust Active Measuring Under Model Uncertainty Merlijn Krale, Thiago D. Simão, Jana Tumova, Nils Jansen
ICML 2024 Scalable Safe Policy Improvement for Factored Multi-Agent MDPs Federico Bianchi, Edoardo Zorzi, Alberto Castellini, Thiago D. Simão, Matthijs T. J. Spaan, Alessandro Farinelli
IJCAI 2023 More for Less: Safe Policy Improvement with Stronger Performance Guarantees Patrick Wienhöft, Marnix Suilen, Thiago D. Simão, Clemens Dubslaff, Christel Baier, Nils Jansen
IJCAI 2023 Recursive Small-Step Multi-Agent A* for Dec-POMDPs Wietze Koops, Nils Jansen, Sebastian Junges, Thiago D. Simão
UAI 2023 Risk-Aware Curriculum Generation for Heavy-Tailed Task Distributions Cevahir Koprulu, Thiago D. Simão, Nils Jansen, Ufuk Topcu
AAAI 2023 Safe Policy Improvement for POMDPs via Finite-State Controllers Thiago D. Simão, Marnix Suilen, Nils Jansen
ICLR 2023 Safe Reinforcement Learning from Pixels Using a Stochastic Latent Representation Yannick Hogewind, Thiago D. Simão, Tal Kachman, Nils Jansen
MLJ 2023 Safety-Constrained Reinforcement Learning with a Distributional Safety Critic Qisong Yang, Thiago D. Simão, Simon H. Tindemans, Matthijs T. J. Spaan
ICML 2023 Scalable Safe Policy Improvement via Monte Carlo Tree Search Alberto Castellini, Federico Bianchi, Edoardo Zorzi, Thiago D. Simão, Alessandro Farinelli, Matthijs T. J. Spaan
NeurIPS 2022 Robust Anytime Learning of Markov Decision Processes Marnix Suilen, Thiago D. Simão, David B. Parker, Nils Jansen
AAAI 2021 WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning Qisong Yang, Thiago D. Simão, Simon H. Tindemans, Matthijs T. J. Spaan
AAAI 2019 Safe Policy Improvement with Baseline Bootstrapping in Factored Environments Thiago D. Simão, Matthijs T. J. Spaan
IJCAI 2019 Safe and Sample-Efficient Reinforcement Learning Algorithms for Factored Environments Thiago D. Simão
IJCAI 2019 Structure Learning for Safe Policy Improvement Thiago D. Simão, Matthijs T. J. Spaan