MAPDP: Cooperative Multi-Agent Reinforcement Learning to Solve Pickup and Delivery Problems

Abstract

Cooperative Pickup and Delivery Problem (PDP), as a variant of the typical Vehicle Routing Problems (VRP), is an important formulation in many real-world applications, such as on-demand delivery, industrial warehousing, etc. It is of great importance to efficiently provide high-quality solutions of cooperative PDP. However, it is not trivial to provide effective solutions directly due to two major challenges: 1) the structural dependency between pickup and delivery pairs require explicit modeling and representation. 2) the cooperation between different vehicles is highly related to the solution exploration and difficult to model. In this paper, we propose a novel multi-agent reinforcement learning based framework to solve the cooperative PDP (MAPDP). First, we design a paired context embedding to well measure the dependency of different nodes considering their structural limits. Second, we utilize cooperative multi-agent decoders to leverage the decision dependence among different vehicle agents based on a special communication embedding. Third, we design a novel cooperative A2C algorithm to train the integrated model. We conduct extensive experiments on a randomly generated dataset and a real-world dataset. Experiments result shown that the proposed MAPDP outperform all other baselines by at least 1.64\% in all settings, and shows significant computation speed during solution inference.

Cite

Text

Zong et al. "MAPDP: Cooperative Multi-Agent Reinforcement Learning to Solve Pickup and Delivery Problems." AAAI Conference on Artificial Intelligence, 2022. doi:10.1609/AAAI.V36I9.21236

Markdown

[Zong et al. "MAPDP: Cooperative Multi-Agent Reinforcement Learning to Solve Pickup and Delivery Problems." AAAI Conference on Artificial Intelligence, 2022.](https://mlanthology.org/aaai/2022/zong2022aaai-mapdp/) doi:10.1609/AAAI.V36I9.21236

BibTeX

@inproceedings{zong2022aaai-mapdp,
  title     = {{MAPDP: Cooperative Multi-Agent Reinforcement Learning to Solve Pickup and Delivery Problems}},
  author    = {Zong, Zefang and Zheng, Meng and Li, Yong and Jin, Depeng},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2022},
  pages     = {9980-9988},
  doi       = {10.1609/AAAI.V36I9.21236},
  url       = {https://mlanthology.org/aaai/2022/zong2022aaai-mapdp/}
}