Online Guidance Graph Optimization for Lifelong Multi-Agent Path Finding
Abstract
We study the problem of optimizing a guidance policy capable of dynamically guiding the agents for lifelong Multi-Agent Path Finding based on real-time traffic patterns. Multi-Agent Path Finding (MAPF) focuses on moving multiple agents from their starts to goals without collisions. Its lifelong variant, LMAPF, continuously assigns new goals to agents. In this work, we focus on improving the solution quality of PIBT, a state-of-the-art rule-based LMAPF algorithm, by optimizing a policy to generate adaptive guidance. We design two pipelines to incorporate guidance in PIBT in two different ways. We demonstrate the superiority of the optimized policy over both static guidance and human-designed policies. Additionally, we explore scenarios where task distribution changes over time, a challenging yet common situation in real-world applications that is rarely explored in the literature.
Cite
Text
Zang et al. "Online Guidance Graph Optimization for Lifelong Multi-Agent Path Finding." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I14.33614Markdown
[Zang et al. "Online Guidance Graph Optimization for Lifelong Multi-Agent Path Finding." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/zang2025aaai-online/) doi:10.1609/AAAI.V39I14.33614BibTeX
@inproceedings{zang2025aaai-online,
title = {{Online Guidance Graph Optimization for Lifelong Multi-Agent Path Finding}},
author = {Zang, Hongzhi and Zhang, Yulun and Jiang, He and Chen, Zhe and Harabor, Daniel and Stuckey, Peter J. and Li, Jiaoyang},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2025},
pages = {14726-14735},
doi = {10.1609/AAAI.V39I14.33614},
url = {https://mlanthology.org/aaai/2025/zang2025aaai-online/}
}