CM3: Cooperative Multi-Goal Multi-Stage Multi-Agent Reinforcement Learning

Yang, Jiachen; Nakhaei, Alireza; Isele, David; Fujimura, Kikuo; Zha, Hongyuan

CM3: Cooperative Multi-Goal Multi-Stage Multi-Agent Reinforcement Learning

Jiachen Yang, Alireza Nakhaei, David Isele, Kikuo Fujimura, Hongyuan Zha

ICLR 2020

/iclr/2020/yang2020iclr-cm3/

Abstract

A variety of cooperative multi-agent control problems require agents to achieve individual goals while contributing to collective success. This multi-goal multi-agent setting poses difficulties for recent algorithms, which primarily target settings with a single global reward, due to two new challenges: efficient exploration for learning both individual goal attainment and cooperation for others' success, and credit-assignment for interactions between actions and goals of different agents. To address both challenges, we restructure the problem into a novel two-stage curriculum, in which single-agent goal attainment is learned prior to learning multi-agent cooperation, and we derive a new multi-goal multi-agent policy gradient with a credit function for localized credit assignment. We use a function augmentation scheme to bridge value and policy functions across the curriculum. The complete architecture, called CM3, learns significantly faster than direct adaptations of existing algorithms on three challenging multi-goal multi-agent problems: cooperative navigation in difficult formations, negotiating multi-vehicle lane changes in the SUMO traffic simulator, and strategic cooperation in a Checkers environment.

PDF ICLR Semantic Scholar

Cite

Text

Yang et al. "CM3: Cooperative Multi-Goal Multi-Stage Multi-Agent Reinforcement Learning." International Conference on Learning Representations, 2020.

Markdown

[Yang et al. "CM3: Cooperative Multi-Goal Multi-Stage Multi-Agent Reinforcement Learning." International Conference on Learning Representations, 2020.](https://mlanthology.org/iclr/2020/yang2020iclr-cm3/)

BibTeX

@inproceedings{yang2020iclr-cm3,
  title     = {{CM3: Cooperative Multi-Goal Multi-Stage Multi-Agent Reinforcement Learning}},
  author    = {Yang, Jiachen and Nakhaei, Alireza and Isele, David and Fujimura, Kikuo and Zha, Hongyuan},
  booktitle = {International Conference on Learning Representations},
  year      = {2020},
  url       = {https://mlanthology.org/iclr/2020/yang2020iclr-cm3/}
}