ML Anthology
Authors
Search
About
Phan, Thomy
11 publications
AAAI
2025
Anytime Multi-Agent Path Finding with an Adaptive Delay-Based Heuristic
Thomy Phan
,
Benran Zhang
,
Shao-Hung Chan
,
Sven Koenig
AAAI
2025
Counterfactual Online Learning for Open-Loop Monte-Carlo Planning
Thomy Phan
,
Shao-Hung Chan
,
Sven Koenig
JAIR
2025
Generative Curricula for Multi-Agent Path Finding via Unsupervised and Reinforcement Learning
Thomy Phan
,
Timy Phan
,
Sven Koenig
AAAI
2024
Adaptive Anytime Multi-Agent Path Finding Using Bandit-Based Large Neighborhood Search
Thomy Phan
,
Taoan Huang
,
Bistra Dilkina
,
Sven Koenig
ICML
2023
Attention-Based Recurrence for Multi-Agent Reinforcement Learning Under Stochastic Partial Observability
Thomy Phan
,
Fabian Ritz
,
Philipp Altmann
,
Maximilian Zorn
,
Jonas Nüßlein
,
Michael Kölle
,
Thomas Gabor
,
Claudia Linnhoff-Popien
IJCAI
2023
CROP: Towards Distributional-Shift Robust Reinforcement Learning Using Compact Reshaped Observation Processing
Philipp Altmann
,
Fabian Ritz
,
Leonard Feuchtinger
,
Jonas Nüßlein
,
Claudia Linnhoff-Popien
,
Thomy Phan
AAAI
2021
Resilient Multi-Agent Reinforcement Learning with Adversarial Value Decomposition
Thomy Phan
,
Lenz Belzner
,
Thomas Gabor
,
Andreas Sedlmeier
,
Fabian Ritz
,
Claudia Linnhoff-Popien
NeurIPS
2021
VAST: Value Function Factorization with Variable Agent Sub-Teams
Thomy Phan
,
Fabian Ritz
,
Lenz Belzner
,
Philipp Altmann
,
Thomas Gabor
,
Claudia Linnhoff-Popien
IJCAI
2019
Adaptive Thompson Sampling Stacks for Memory Bounded Open-Loop Planning
Thomy Phan
,
Thomas Gabor
,
Robert Müller
,
Christoph Roch
,
Claudia Linnhoff-Popien
AAAI
2019
Memory Bounded Open-Loop Planning in Large POMDPs Using Thompson Sampling
Thomy Phan
,
Lenz Belzner
,
Marie Kiermeier
,
Markus Friedrich
,
Kyrill Schmid
,
Claudia Linnhoff-Popien
IJCAI
2019
Subgoal-Based Temporal Abstraction in Monte-Carlo Tree Search
Thomas Gabor
,
Jan Peter
,
Thomy Phan
,
Christian Meyer
,
Claudia Linnhoff-Popien