ML Anthology
Authors
Search
About
Spaan, Matthijs T. J.
37 publications
AAAI
2025
Epistemic Bellman Operators
Pascal R. van der Vaart
,
Matthijs T. J. Spaan
,
Neil Yorke-Smith
ICLR
2025
Epistemic Monte Carlo Tree Search
Yaniv Oren
,
Viliam Vadocz
,
Matthijs T. J. Spaan
,
Wendelin Boehmer
NeurIPS
2025
How Ensembles of Distilled Policies Improve Generalisation in Reinforcement Learning
Max Weltevrede
,
Moritz Akiya Zanger
,
Matthijs T. J. Spaan
,
Wendelin Boehmer
JAIR
2025
Scaling Safe Policy Improvement: Monte Carlo Tree Search and Policy Iteration Strategies
Federico Bianchi
,
Alberto Castellini
,
Edoardo Zorzi
,
Thiago D. Simão
,
Matthijs T. J. Spaan
,
Alessandro Farinelli
ICML
2025
Trust-Region Twisted Policy Improvement
Joery A. De Vries
,
Jinke He
,
Yaniv Oren
,
Matthijs T. J. Spaan
NeurIPS
2025
Value Improved Actor Critic Algorithms
Yaniv Oren
,
Moritz Akiya Zanger
,
Pascal R. Van der Vaart
,
Mustafa Mert Çelikok
,
Wendelin Boehmer
,
Matthijs T. J. Spaan
IJCAI
2025
VeRecycle: Reclaiming Guarantees from Probabilistic Certificates for Stochastic Dynamical Systems After Change
Sterre Lutz
,
Matthijs T. J. Spaan
,
Anna Lukina
ICLR
2024
Diverse Projection Ensembles for Distributional Reinforcement Learning
Moritz Akiya Zanger
,
Wendelin Boehmer
,
Matthijs T. J. Spaan
NeurIPSW
2024
Positive Experience Reflection for Agents in Interactive Text Environments
Philip Lippmann
,
Matthijs T. J. Spaan
,
Jie Yang
ICML
2024
Scalable Safe Policy Improvement for Factored Multi-Agent MDPs
Federico Bianchi
,
Edoardo Zorzi
,
Alberto Castellini
,
Thiago D. Simão
,
Matthijs T. J. Spaan
,
Alessandro Farinelli
AAAI
2023
CEM: Constrained Entropy Maximization for Task-Agnostic Safe Exploration
Qisong Yang
,
Matthijs T. J. Spaan
MLJ
2023
Safety-Constrained Reinforcement Learning with a Distributional Safety Critic
Qisong Yang
,
Thiago D. Simão
,
Simon H. Tindemans
,
Matthijs T. J. Spaan
ICML
2023
Scalable Safe Policy Improvement via Monte Carlo Tree Search
Alberto Castellini
,
Federico Bianchi
,
Edoardo Zorzi
,
Thiago D. Simão
,
Alessandro Farinelli
,
Matthijs T. J. Spaan
ICML
2022
Influence-Augmented Local Simulators: A Scalable Solution for Fast Deep RL in Large Networked Systems
Miguel Suau
,
Jinke He
,
Matthijs T. J. Spaan
,
Frans Oliehoek
JAIR
2021
Constrained Multiagent Markov Decision Processes: A Taxonomy of Problems and Algorithms
Frits de Nijs
,
Erwin Walraven
,
Mathijs Michiel de Weerdt
,
Matthijs T. J. Spaan
AAAI
2021
WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning
Qisong Yang
,
Thiago D. Simão
,
Simon H. Tindemans
,
Matthijs T. J. Spaan
JAIR
2019
Point-Based Value Iteration for Finite-Horizon POMDPs
Erwin Walraven
,
Matthijs T. J. Spaan
AAAI
2019
Safe Policy Improvement with Baseline Bootstrapping in Factored Environments
Thiago D. Simão
,
Matthijs T. J. Spaan
IJCAI
2019
Structure Learning for Safe Policy Improvement
Thiago D. Simão
,
Matthijs T. J. Spaan
JAIR
2018
Column Generation Algorithms for Constrained POMDPs
Erwin Walraven
,
Matthijs T. J. Spaan
AAAI
2018
Preallocation and Planning Under Stochastic Resource Constraints
Frits de Nijs
,
Matthijs T. J. Spaan
,
Mathijs Michiel de Weerdt
AAAI
2017
Accelerated Vector Pruning for Optimal POMDP Solvers
Erwin Walraven
,
Matthijs T. J. Spaan
AAAI
2017
Bounding the Probability of Resource Constraint Violations in Multi-Agent MDPs
Frits de Nijs
,
Erwin Walraven
,
Mathijs Michiel de Weerdt
,
Matthijs T. J. Spaan
MLOSS
2017
The MADP Toolbox: An Open Source Library for Planning and Learning in (Multi-)Agent Systems
Frans A. Oliehoek
,
Matthijs T. J. Spaan
,
Bas Terwijn
,
Philipp Robbel
,
João V. Messias
AAAI
2016
Solving Transition-Independent Multi-Agent MDPs with Sparse Interactions
Joris Scharpff
,
Diederik M. Roijers
,
Frans A. Oliehoek
,
Matthijs T. J. Spaan
,
Mathijs Michiel de Weerdt
AAAI
2015
Best-Response Planning of Thermostatically Controlled Loads Under Power Constraints
Frits de Nijs
,
Matthijs T. J. Spaan
,
Mathijs de Weerdt
IJCAI
2015
Factored Upper Bounds for Multiagent Planning Problems Under Uncertainty with Non-Factored Value Functions
Frans Adriaan Oliehoek
,
Matthijs T. J. Spaan
,
Stefan J. Witwicki
UAI
2015
Planning Under Uncertainty with Weighted State Scenarios
Erwin Walraven
,
Matthijs T. J. Spaan
AAAI
2014
Point-Based POMDP Solving with Factored Value Function Approximation
Tiago Veiga
,
Matthijs T. J. Spaan
,
Pedro U. Lima
AAAI
2013
GSMDPs for Multi-Robot Sequential Decision-Making
João Vicente Messias
,
Matthijs T. J. Spaan
,
Pedro U. Lima
JAIR
2013
Incremental Clustering and Expansion for Faster Optimal Planning in Dec-POMDPs
Frans A. Oliehoek
,
Matthijs T. J. Spaan
,
Christopher Amato
,
Shimon Whiteson
UAI
2012
Exploiting Structure in Cooperative Bayesian Games
Frans A. Oliehoek
,
Shimon Whiteson
,
Matthijs T. J. Spaan
AAAI
2012
Tree-Based Solution Methods for Multiagent POMDPs with Delayed Communication
Frans Adriaan Oliehoek
,
Matthijs T. J. Spaan
IJCAI
2011
Scaling up Optimal Heuristic Search in Dec-POMDPs via Incremental Expansion
Matthijs T. J. Spaan
,
Frans A. Oliehoek
,
Christopher Amato
JAIR
2008
Optimal and Approximate Q-Value Functions for Decentralized POMDPs
Frans A. Oliehoek
,
Matthijs T. J. Spaan
,
Nikos Vlassis
JMLR
2006
Point-Based Value Iteration for Continuous POMDPs
Josep M. Porta
,
Nikos Vlassis
,
Matthijs T.J. Spaan
,
Pascal Poupart
JAIR
2005
Perseus: Randomized Point-Based Value Iteration for POMDPs
Matthijs T. J. Spaan
,
Nikos Vlassis