ML Anthology
Authors
Search
About
Simão, Thiago D.
19 publications
NeurIPS
2025
On Evaluating Policies for Robust POMDPs
Merlijn Krale
,
Eline M. Bovy
,
Maris F. L. Galesloot
,
Thiago D. Simão
,
Nils Jansen
ICLR
2025
Robust Transfer of Safety-Constrained Reinforcement Learning Agents
Markel Zubia
,
Thiago D. Simão
,
Nils Jansen
ICLR
2025
Safety-Prioritizing Curricula for Constrained Reinforcement Learning
Cevahir Koprulu
,
Thiago D. Simão
,
Nils Jansen
,
Ufuk Topcu
JAIR
2025
Scaling Safe Policy Improvement: Monte Carlo Tree Search and Policy Iteration Strategies
Federico Bianchi
,
Alberto Castellini
,
Edoardo Zorzi
,
Thiago D. Simão
,
Matthijs T. J. Spaan
,
Alessandro Farinelli
AAAI
2024
Factored Online Planning in Many-Agent POMDPs
Maris F. L. Galesloot
,
Thiago D. Simão
,
Sebastian Junges
,
Nils Jansen
AAAI
2024
Robust Active Measuring Under Model Uncertainty
Merlijn Krale
,
Thiago D. Simão
,
Jana Tumova
,
Nils Jansen
ICML
2024
Scalable Safe Policy Improvement for Factored Multi-Agent MDPs
Federico Bianchi
,
Edoardo Zorzi
,
Alberto Castellini
,
Thiago D. Simão
,
Matthijs T. J. Spaan
,
Alessandro Farinelli
IJCAI
2023
More for Less: Safe Policy Improvement with Stronger Performance Guarantees
Patrick Wienhöft
,
Marnix Suilen
,
Thiago D. Simão
,
Clemens Dubslaff
,
Christel Baier
,
Nils Jansen
IJCAI
2023
Recursive Small-Step Multi-Agent A* for Dec-POMDPs
Wietze Koops
,
Nils Jansen
,
Sebastian Junges
,
Thiago D. Simão
UAI
2023
Risk-Aware Curriculum Generation for Heavy-Tailed Task Distributions
Cevahir Koprulu
,
Thiago D. Simão
,
Nils Jansen
,
Ufuk Topcu
AAAI
2023
Safe Policy Improvement for POMDPs via Finite-State Controllers
Thiago D. Simão
,
Marnix Suilen
,
Nils Jansen
ICLR
2023
Safe Reinforcement Learning from Pixels Using a Stochastic Latent Representation
Yannick Hogewind
,
Thiago D. Simão
,
Tal Kachman
,
Nils Jansen
MLJ
2023
Safety-Constrained Reinforcement Learning with a Distributional Safety Critic
Qisong Yang
,
Thiago D. Simão
,
Simon H. Tindemans
,
Matthijs T. J. Spaan
ICML
2023
Scalable Safe Policy Improvement via Monte Carlo Tree Search
Alberto Castellini
,
Federico Bianchi
,
Edoardo Zorzi
,
Thiago D. Simão
,
Alessandro Farinelli
,
Matthijs T. J. Spaan
NeurIPS
2022
Robust Anytime Learning of Markov Decision Processes
Marnix Suilen
,
Thiago D. Simão
,
David B. Parker
,
Nils Jansen
AAAI
2021
WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning
Qisong Yang
,
Thiago D. Simão
,
Simon H. Tindemans
,
Matthijs T. J. Spaan
AAAI
2019
Safe Policy Improvement with Baseline Bootstrapping in Factored Environments
Thiago D. Simão
,
Matthijs T. J. Spaan
IJCAI
2019
Safe and Sample-Efficient Reinforcement Learning Algorithms for Factored Environments
Thiago D. Simão
IJCAI
2019
Structure Learning for Safe Policy Improvement
Thiago D. Simão
,
Matthijs T. J. Spaan