Automatic Grouping for Efficient Cooperative Multi-Agent Reinforcement Learning
Abstract
Grouping is ubiquitous in natural systems and is essential for promoting efficiency in team coordination. This paper proposes a novel formulation of Group-oriented Multi-Agent Reinforcement Learning (GoMARL), which learns automatic grouping without domain knowledge for efficient cooperation. In contrast to existing approaches that attempt to directly learn the complex relationship between the joint action-values and individual utilities, we empower subgroups as a bridge to model the connection between small sets of agents and encourage cooperation among them, thereby improving the learning efficiency of the whole team. In particular, we factorize the joint action-values as a combination of group-wise values, which guide agents to improve their policies in a fine-grained fashion. We present an automatic grouping mechanism to generate dynamic groups and group action-values. We further introduce a hierarchical control for policy learning that drives the agents in the same group to specialize in similar policies and possess diverse strategies for various groups. Experiments on the StarCraft II micromanagement tasks and Google Research Football scenarios verify our method's effectiveness. Extensive component studies show how grouping works and enhances performance.
Cite
Text
Zang et al. "Automatic Grouping for Efficient Cooperative Multi-Agent Reinforcement Learning." Neural Information Processing Systems, 2023.Markdown
[Zang et al. "Automatic Grouping for Efficient Cooperative Multi-Agent Reinforcement Learning." Neural Information Processing Systems, 2023.](https://mlanthology.org/neurips/2023/zang2023neurips-automatic/)BibTeX
@inproceedings{zang2023neurips-automatic,
title = {{Automatic Grouping for Efficient Cooperative Multi-Agent Reinforcement Learning}},
author = {Zang, Yifan and He, Jinmin and Li, Kai and Fu, Haobo and Fu, Qiang and Xing, Junliang and Cheng, Jian},
booktitle = {Neural Information Processing Systems},
year = {2023},
url = {https://mlanthology.org/neurips/2023/zang2023neurips-automatic/}
}