The Emergence of Adversarial Communication in Multi-Agent Reinforcement Learning
Abstract
Many real-world problems require the coordination of multiple autonomous agents. Recent work has shown the promise of Graph Neural Networks (GNNs) to learn explicit communication strategies that enable complex multi-agent coordination. These works use models of cooperative multi-agent systems whereby agents strive to achieve a shared global goal. When considering agents with self-interested local objectives, the standard design choice is to model these as separate learning systems (albeit sharing the same environment). Such a design choice, however, precludes the existence of a single, differentiable communication channel, and consequently prohibits the learning of inter-agent communication strategies. In this work, we address this gap by presenting a learning model that accommodates individual non-shared rewards and a differentiable communication channel that is common among all agents. We focus on the case where agents have self-interested objectives, and develop a learning algorithm that elicits the emergence of adversarial communications. We perform experiments on multi-agent coverage and path planning problems, and employ a post-hoc interpretability technique to visualize the messages that agents communicate to each other. We show how a single self-interested agent is capable of learning highly manipulative communication strategies that allows it to significantly outperform a cooperative team of agents.
Cite
Text
Blumenkamp and Prorok. "The Emergence of Adversarial Communication in Multi-Agent Reinforcement Learning." Conference on Robot Learning, 2020.Markdown
[Blumenkamp and Prorok. "The Emergence of Adversarial Communication in Multi-Agent Reinforcement Learning." Conference on Robot Learning, 2020.](https://mlanthology.org/corl/2020/blumenkamp2020corl-emergence/)BibTeX
@inproceedings{blumenkamp2020corl-emergence,
title = {{The Emergence of Adversarial Communication in Multi-Agent Reinforcement Learning}},
author = {Blumenkamp, Jan and Prorok, Amanda},
booktitle = {Conference on Robot Learning},
year = {2020},
pages = {1394-1414},
volume = {155},
url = {https://mlanthology.org/corl/2020/blumenkamp2020corl-emergence/}
}