MAP-THOR: Benchmarking Long-Horizon Multi-Agent Planning Frameworks in Partially Observable Environments

Abstract

Evaluating embodied multi-agent planners necessitates robust and versatile benchmarks. We introduce MAP-THOR (Multi-Agent Planning in AI2-THOR), a benchmark specifically designed to assess the performance of embodied multi-agent planning systems in realistic, partially observable environments within the AI2-THOR environment. Existing benchmarks offer extensive environments for single-agent tasks, but fail to capture the complexities inherent in multi-agent interactions, non-stationarity, partial observability and long-horizon planning. Addressing these gaps, MAP-THOR facilitates the development of frameworks that allocate tasks and enable coordination among multiple agents. MAP-THOR introduces a comprehensive suite of household tasks demanding collaboration and adaptation to dynamic environmental changes, mirroring real-world scenarios. Our benchmark includes detailed metrics for success rate, efficiency, and collaborative effectiveness, setting a new standard for evaluating multi-agent planning systems. Through rigorous experiments, we show that MAP-THOR offers a robust evaluation framework for language model (LM)-based multi-agent planning systems. Ultimately, we hope that MAP-THOR serves as a standard benchmark to identify embodied multi-agent planning frameworks that systematically improve generalization for long-horizon partially observable planning.

Cite

Text

Nayak et al. "MAP-THOR: Benchmarking Long-Horizon Multi-Agent Planning Frameworks in Partially Observable Environments." ICML 2024 Workshops: MFM-EAI, 2024.

Markdown

[Nayak et al. "MAP-THOR: Benchmarking Long-Horizon Multi-Agent Planning Frameworks in Partially Observable Environments." ICML 2024 Workshops: MFM-EAI, 2024.](https://mlanthology.org/icmlw/2024/nayak2024icmlw-mapthor/)

BibTeX

@inproceedings{nayak2024icmlw-mapthor,
  title     = {{MAP-THOR: Benchmarking Long-Horizon Multi-Agent Planning Frameworks in Partially Observable Environments}},
  author    = {Nayak, Siddharth and Orozco, Adelmo Morrison and Ten Have, Marina and Thirumalai, Vittal and Zhang, Jackson and Chen, Darren and Kapoor, Aditya and Robinson, Eric and Gopalakrishnan, Karthik and Ichter, Brian and Harrison, James and Mahajan, Anuj and Balakrishnan, Hamsa},
  booktitle = {ICML 2024 Workshops: MFM-EAI},
  year      = {2024},
  url       = {https://mlanthology.org/icmlw/2024/nayak2024icmlw-mapthor/}
}