Phan, Thomy

11 publications

AAAI 2025 Anytime Multi-Agent Path Finding with an Adaptive Delay-Based Heuristic Thomy Phan, Benran Zhang, Shao-Hung Chan, Sven Koenig
AAAI 2025 Counterfactual Online Learning for Open-Loop Monte-Carlo Planning Thomy Phan, Shao-Hung Chan, Sven Koenig
JAIR 2025 Generative Curricula for Multi-Agent Path Finding via Unsupervised and Reinforcement Learning Thomy Phan, Timy Phan, Sven Koenig
AAAI 2024 Adaptive Anytime Multi-Agent Path Finding Using Bandit-Based Large Neighborhood Search Thomy Phan, Taoan Huang, Bistra Dilkina, Sven Koenig
ICML 2023 Attention-Based Recurrence for Multi-Agent Reinforcement Learning Under Stochastic Partial Observability Thomy Phan, Fabian Ritz, Philipp Altmann, Maximilian Zorn, Jonas Nüßlein, Michael Kölle, Thomas Gabor, Claudia Linnhoff-Popien
IJCAI 2023 CROP: Towards Distributional-Shift Robust Reinforcement Learning Using Compact Reshaped Observation Processing Philipp Altmann, Fabian Ritz, Leonard Feuchtinger, Jonas Nüßlein, Claudia Linnhoff-Popien, Thomy Phan
AAAI 2021 Resilient Multi-Agent Reinforcement Learning with Adversarial Value Decomposition Thomy Phan, Lenz Belzner, Thomas Gabor, Andreas Sedlmeier, Fabian Ritz, Claudia Linnhoff-Popien
NeurIPS 2021 VAST: Value Function Factorization with Variable Agent Sub-Teams Thomy Phan, Fabian Ritz, Lenz Belzner, Philipp Altmann, Thomas Gabor, Claudia Linnhoff-Popien
IJCAI 2019 Adaptive Thompson Sampling Stacks for Memory Bounded Open-Loop Planning Thomy Phan, Thomas Gabor, Robert Müller, Christoph Roch, Claudia Linnhoff-Popien
AAAI 2019 Memory Bounded Open-Loop Planning in Large POMDPs Using Thompson Sampling Thomy Phan, Lenz Belzner, Marie Kiermeier, Markus Friedrich, Kyrill Schmid, Claudia Linnhoff-Popien
IJCAI 2019 Subgoal-Based Temporal Abstraction in Monte-Carlo Tree Search Thomas Gabor, Jan Peter, Thomy Phan, Christian Meyer, Claudia Linnhoff-Popien