Context-Aware Sparse Deep Coordination Graphs

Abstract

Learning sparse coordination graphs adaptive to the coordination dynamics among agents is a long-standing problem in cooperative multi-agent learning. This paper studies this problem and proposes a novel method using the variance of payoff functions to construct context-aware sparse coordination topologies. We theoretically consolidate our method by proving that the smaller the variance of payoff functions is, the less likely action selection will change after removing the corresponding edge. Moreover, we propose to learn action representations to effectively reduce the influence of payoff functions' estimation errors on graph construction. To empirically evaluate our method, we present the Multi-Agent COordination (MACO) benchmark by collecting classic coordination problems in the literature, increasing their difficulty, and classifying them into different types. We carry out a case study and experiments on the MACO and StarCraft II micromanagement benchmark to demonstrate the dynamics of sparse graph learning, the influence of graph sparseness, and the learning performance of our method.

Cite

Text

Wang et al. "Context-Aware Sparse Deep Coordination Graphs." International Conference on Learning Representations, 2022.

Markdown

[Wang et al. "Context-Aware Sparse Deep Coordination Graphs." International Conference on Learning Representations, 2022.](https://mlanthology.org/iclr/2022/wang2022iclr-contextaware/)

BibTeX

@inproceedings{wang2022iclr-contextaware,
  title     = {{Context-Aware Sparse Deep Coordination Graphs}},
  author    = {Wang, Tonghan and Zeng, Liang and Dong, Weijun and Yang, Qianlan and Yu, Yang and Zhang, Chongjie},
  booktitle = {International Conference on Learning Representations},
  year      = {2022},
  url       = {https://mlanthology.org/iclr/2022/wang2022iclr-contextaware/}
}