MAESTRO: Open-Ended Environment Design for Multi-Agent Reinforcement Learning
Abstract
Open-ended learning methods that automatically generate a curriculum of increasingly challenging tasks serve as a promising avenue toward generally capable reinforcement learning (RL) agents. Existing methods adapt curricula independently over either environment parameters (in single-agent settings) or co-player policies (in multi-agent settings). However, the strengths and weaknesses of co-players can manifest themselves differently depending on environmental features. It is thus crucial to consider the dependency between the environment and co-player when shaping a curriculum in multi-agent domains. In this work, we use this insight and extend Unsupervised Environment Design (UED) to multi-agent environments. We then introduce Multi-Agent Environment Design Strategist for Open-Ended Learning (MAESTRO), the first multi-agent UED approach for two-player zero-sum settings. MAESTRO efficiently produces adversarial, joint curricula over both environments and co-players and attains minimax-regret guarantees at Nash equilibrium. Our experiments show that MAESTRO outperforms a number of strong baselines on competitive two-player environments, spanning discrete and continuous control.
Cite
Text
Samvelyan et al. "MAESTRO: Open-Ended Environment Design for Multi-Agent Reinforcement Learning." NeurIPS 2022 Workshops: DeepRL, 2022.Markdown
[Samvelyan et al. "MAESTRO: Open-Ended Environment Design for Multi-Agent Reinforcement Learning." NeurIPS 2022 Workshops: DeepRL, 2022.](https://mlanthology.org/neuripsw/2022/samvelyan2022neuripsw-maestro/)BibTeX
@inproceedings{samvelyan2022neuripsw-maestro,
title = {{MAESTRO: Open-Ended Environment Design for Multi-Agent Reinforcement Learning}},
author = {Samvelyan, Mikayel and Khan, Akbir and Dennis, Michael D and Jiang, Minqi and Parker-Holder, Jack and Foerster, Jakob Nicolaus and Raileanu, Roberta and Rocktäschel, Tim},
booktitle = {NeurIPS 2022 Workshops: DeepRL},
year = {2022},
url = {https://mlanthology.org/neuripsw/2022/samvelyan2022neuripsw-maestro/}
}