Spaan, Matthijs T. J.

37 publications

AAAI 2025 Epistemic Bellman Operators Pascal R. van der Vaart, Matthijs T. J. Spaan, Neil Yorke-Smith
ICLR 2025 Epistemic Monte Carlo Tree Search Yaniv Oren, Viliam Vadocz, Matthijs T. J. Spaan, Wendelin Boehmer
NeurIPS 2025 How Ensembles of Distilled Policies Improve Generalisation in Reinforcement Learning Max Weltevrede, Moritz Akiya Zanger, Matthijs T. J. Spaan, Wendelin Boehmer
JAIR 2025 Scaling Safe Policy Improvement: Monte Carlo Tree Search and Policy Iteration Strategies Federico Bianchi, Alberto Castellini, Edoardo Zorzi, Thiago D. Simão, Matthijs T. J. Spaan, Alessandro Farinelli
ICML 2025 Trust-Region Twisted Policy Improvement Joery A. De Vries, Jinke He, Yaniv Oren, Matthijs T. J. Spaan
NeurIPS 2025 Value Improved Actor Critic Algorithms Yaniv Oren, Moritz Akiya Zanger, Pascal R. Van der Vaart, Mustafa Mert Çelikok, Wendelin Boehmer, Matthijs T. J. Spaan
IJCAI 2025 VeRecycle: Reclaiming Guarantees from Probabilistic Certificates for Stochastic Dynamical Systems After Change Sterre Lutz, Matthijs T. J. Spaan, Anna Lukina
ICLR 2024 Diverse Projection Ensembles for Distributional Reinforcement Learning Moritz Akiya Zanger, Wendelin Boehmer, Matthijs T. J. Spaan
NeurIPSW 2024 Positive Experience Reflection for Agents in Interactive Text Environments Philip Lippmann, Matthijs T. J. Spaan, Jie Yang
ICML 2024 Scalable Safe Policy Improvement for Factored Multi-Agent MDPs Federico Bianchi, Edoardo Zorzi, Alberto Castellini, Thiago D. Simão, Matthijs T. J. Spaan, Alessandro Farinelli
AAAI 2023 CEM: Constrained Entropy Maximization for Task-Agnostic Safe Exploration Qisong Yang, Matthijs T. J. Spaan
MLJ 2023 Safety-Constrained Reinforcement Learning with a Distributional Safety Critic Qisong Yang, Thiago D. Simão, Simon H. Tindemans, Matthijs T. J. Spaan
ICML 2023 Scalable Safe Policy Improvement via Monte Carlo Tree Search Alberto Castellini, Federico Bianchi, Edoardo Zorzi, Thiago D. Simão, Alessandro Farinelli, Matthijs T. J. Spaan
ICML 2022 Influence-Augmented Local Simulators: A Scalable Solution for Fast Deep RL in Large Networked Systems Miguel Suau, Jinke He, Matthijs T. J. Spaan, Frans Oliehoek
JAIR 2021 Constrained Multiagent Markov Decision Processes: A Taxonomy of Problems and Algorithms Frits de Nijs, Erwin Walraven, Mathijs Michiel de Weerdt, Matthijs T. J. Spaan
AAAI 2021 WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning Qisong Yang, Thiago D. Simão, Simon H. Tindemans, Matthijs T. J. Spaan
JAIR 2019 Point-Based Value Iteration for Finite-Horizon POMDPs Erwin Walraven, Matthijs T. J. Spaan
AAAI 2019 Safe Policy Improvement with Baseline Bootstrapping in Factored Environments Thiago D. Simão, Matthijs T. J. Spaan
IJCAI 2019 Structure Learning for Safe Policy Improvement Thiago D. Simão, Matthijs T. J. Spaan
JAIR 2018 Column Generation Algorithms for Constrained POMDPs Erwin Walraven, Matthijs T. J. Spaan
AAAI 2018 Preallocation and Planning Under Stochastic Resource Constraints Frits de Nijs, Matthijs T. J. Spaan, Mathijs Michiel de Weerdt
AAAI 2017 Accelerated Vector Pruning for Optimal POMDP Solvers Erwin Walraven, Matthijs T. J. Spaan
AAAI 2017 Bounding the Probability of Resource Constraint Violations in Multi-Agent MDPs Frits de Nijs, Erwin Walraven, Mathijs Michiel de Weerdt, Matthijs T. J. Spaan
MLOSS 2017 The MADP Toolbox: An Open Source Library for Planning and Learning in (Multi-)Agent Systems Frans A. Oliehoek, Matthijs T. J. Spaan, Bas Terwijn, Philipp Robbel, João V. Messias
AAAI 2016 Solving Transition-Independent Multi-Agent MDPs with Sparse Interactions Joris Scharpff, Diederik M. Roijers, Frans A. Oliehoek, Matthijs T. J. Spaan, Mathijs Michiel de Weerdt
AAAI 2015 Best-Response Planning of Thermostatically Controlled Loads Under Power Constraints Frits de Nijs, Matthijs T. J. Spaan, Mathijs de Weerdt
IJCAI 2015 Factored Upper Bounds for Multiagent Planning Problems Under Uncertainty with Non-Factored Value Functions Frans Adriaan Oliehoek, Matthijs T. J. Spaan, Stefan J. Witwicki
UAI 2015 Planning Under Uncertainty with Weighted State Scenarios Erwin Walraven, Matthijs T. J. Spaan
AAAI 2014 Point-Based POMDP Solving with Factored Value Function Approximation Tiago Veiga, Matthijs T. J. Spaan, Pedro U. Lima
AAAI 2013 GSMDPs for Multi-Robot Sequential Decision-Making João Vicente Messias, Matthijs T. J. Spaan, Pedro U. Lima
JAIR 2013 Incremental Clustering and Expansion for Faster Optimal Planning in Dec-POMDPs Frans A. Oliehoek, Matthijs T. J. Spaan, Christopher Amato, Shimon Whiteson
UAI 2012 Exploiting Structure in Cooperative Bayesian Games Frans A. Oliehoek, Shimon Whiteson, Matthijs T. J. Spaan
AAAI 2012 Tree-Based Solution Methods for Multiagent POMDPs with Delayed Communication Frans Adriaan Oliehoek, Matthijs T. J. Spaan
IJCAI 2011 Scaling up Optimal Heuristic Search in Dec-POMDPs via Incremental Expansion Matthijs T. J. Spaan, Frans A. Oliehoek, Christopher Amato
JAIR 2008 Optimal and Approximate Q-Value Functions for Decentralized POMDPs Frans A. Oliehoek, Matthijs T. J. Spaan, Nikos Vlassis
JMLR 2006 Point-Based Value Iteration for Continuous POMDPs Josep M. Porta, Nikos Vlassis, Matthijs T.J. Spaan, Pascal Poupart
JAIR 2005 Perseus: Randomized Point-Based Value Iteration for POMDPs Matthijs T. J. Spaan, Nikos Vlassis