Learning Efficient Multi-Agent Cooperative Visual Exploration
Abstract
We tackle the problem of cooperative visual exploration where multiple agents need to jointly explore unseen regions as fast as possible based on visual signals. Classical planning-based methods often suffer from expensive computation overhead at each step and a limited expressiveness of complex cooperation strategy. By contrast, reinforcement learning (RL) has recently become a popular paradigm for tackling this challenge due to its modeling capability of arbitrarily complex strategies and minimal inference overhead. In this paper, we propose a novel RL-based multi-agent planning module, Multi-agent Spatial Planner (MSP). MSP leverages a transformer-based architecture, Spatial-TeamFormer, which effectively captures spatial relations and intra-agent interactions via hierarchical spatial self-attentions. In addition, we also implement a few multi-agent enhancements to process local information from each agent for an aligned spatial representation and more precise planning. Finally, we perform policy distillation to extract a meta policy to significantly improve the generalization capability of final policy. We call this overall solution, Multi-Agent Active Neural SLAM (MAANS). MAANS substantially outperforms classical planning-based baselines for the first time in a photo-realistic 3D simulator, Habitat. Code and videos can be found at https://sites.google.com/view/maans.
Cite
Text
Yu et al. "Learning Efficient Multi-Agent Cooperative Visual Exploration." Proceedings of the European Conference on Computer Vision (ECCV), 2022. doi:10.1007/978-3-031-19842-7_29Markdown
[Yu et al. "Learning Efficient Multi-Agent Cooperative Visual Exploration." Proceedings of the European Conference on Computer Vision (ECCV), 2022.](https://mlanthology.org/eccv/2022/yu2022eccv-learning/) doi:10.1007/978-3-031-19842-7_29BibTeX
@inproceedings{yu2022eccv-learning,
title = {{Learning Efficient Multi-Agent Cooperative Visual Exploration}},
author = {Yu, Chao and Yang, Xinyi and Gao, Jiaxuan and Yang, Huazhong and Wang, Yu and Wu, Yi},
booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
year = {2022},
doi = {10.1007/978-3-031-19842-7_29},
url = {https://mlanthology.org/eccv/2022/yu2022eccv-learning/}
}