MALib: A Parallel Framework for Population-Based Multi-Agent Reinforcement Learning

Zhou, Ming; Wan, Ziyu; Wang, Hanjing; Wen, Muning; Wu, Runzhe; Wen, Ying; Yang, Yaodong; Yu, Yong; Wang, Jun; Zhang, Weinan

MALib: A Parallel Framework for Population-Based Multi-Agent Reinforcement Learning

Ming Zhou, Ziyu Wan, Hanjing Wang, Muning Wen, Runzhe Wu, Ying Wen, Yaodong Yang, Yong Yu, Jun Wang, Weinan Zhang

MLOSS 2023 pp. 1-12

/mloss/2023/zhou2023jmlr-malib/

Abstract

Population-based multi-agent reinforcement learning (PB-MARL) encompasses a range of methods that merge dynamic population selection with multi-agent reinforcement learning algorithms (MARL). While PB-MARL has demonstrated notable achievements in complex multi-agent tasks, its sequential execution is plagued by low computational efficiency due to the diversity in computing patterns and policy combinations. We propose a solution involving a stateless central task dispatcher and stateful workers to handle PB-MARL's subroutines, thereby capitalizing on parallelism across various components for efficient problem-solving. In line with this approach, we introduce MALib, a parallel framework that incorporates a task control model, independent data servers, and an abstraction of MARL training paradigms. The framework has undergone extensive testing and is available under the MIT license (https://github.com/sjtu-marl/malib)

PDF MLOSS Code Semantic Scholar

Cite

Text

Zhou et al. "MALib: A Parallel Framework for Population-Based Multi-Agent Reinforcement Learning." Machine Learning Open Source Software, 2023.

Markdown

[Zhou et al. "MALib: A Parallel Framework for Population-Based Multi-Agent Reinforcement Learning." Machine Learning Open Source Software, 2023.](https://mlanthology.org/mloss/2023/zhou2023jmlr-malib/)

BibTeX

@article{zhou2023jmlr-malib,
  title     = {{MALib: A Parallel Framework for Population-Based Multi-Agent Reinforcement Learning}},
  author    = {Zhou, Ming and Wan, Ziyu and Wang, Hanjing and Wen, Muning and Wu, Runzhe and Wen, Ying and Yang, Yaodong and Yu, Yong and Wang, Jun and Zhang, Weinan},
  journal   = {Machine Learning Open Source Software},
  year      = {2023},
  pages     = {1-12},
  volume    = {24},
  url       = {https://mlanthology.org/mloss/2023/zhou2023jmlr-malib/}
}