Generalize Learned Heuristics to Solve Large-Scale Vehicle Routing Problems in Real-Time
Abstract
Large-scale Vehicle Routing Problems (VRPs) are widely used in logistics, transportation, supply chain, and robotic systems. Recently, data-driven VRP heuristics are proposed to generate real-time VRP solutions with up to 100 nodes. Despite this progress, current heuristics for large-scale VRPs still face three major challenges: 1) Difficulty in generalizing the heuristics learned on small-scale VRPs to large-scale VRPs without retraining; 2) Challenge in generating real-time solutions for large-scale VRPs; 3) Difficulty in embedding global constraints into learned heuristics. We contribute in the three directions: We propose a Two-stage Divide Method (TAM) to generate sub-route sequence rather than node sequence for generalizing the heuristics learned on small-scale VRPs to solve large-scale VRPs in real-time. A two-step reinforcement learning method with new reward and padding techniques is proposed to train our TAM. A global mask function is proposed to keep the global constraints satisfied when dividing a large-scale VRP into several small-scale Traveling Salesman Problems (TSPs). As result, we can solve the small-scale TSPs in parallel quickly. The experiments on synthetic and real-world large-scale VRPs show our method could generalize the learned heuristics trained on datasets of VRP 100 to solve VRPs with over 5000 nodes in real-time while keeping the solution quality better than data-driven heuristics and competitive with traditional heuristics.
Cite
Text
Hou et al. "Generalize Learned Heuristics to Solve Large-Scale Vehicle Routing Problems in Real-Time." International Conference on Learning Representations, 2023.Markdown
[Hou et al. "Generalize Learned Heuristics to Solve Large-Scale Vehicle Routing Problems in Real-Time." International Conference on Learning Representations, 2023.](https://mlanthology.org/iclr/2023/hou2023iclr-generalize/)BibTeX
@inproceedings{hou2023iclr-generalize,
title = {{Generalize Learned Heuristics to Solve Large-Scale Vehicle Routing Problems in Real-Time}},
author = {Hou, Qingchun and Yang, Jingwei and Su, Yiqiang and Wang, Xiaoqing and Deng, Yuming},
booktitle = {International Conference on Learning Representations},
year = {2023},
url = {https://mlanthology.org/iclr/2023/hou2023iclr-generalize/}
}