MTL-KD: Multi-Task Learning via Knowledge Distillation for Generalizable Neural Vehicle Routing Solver
Abstract
Multi-Task Learning (MTL) in Neural Combinatorial Optimization (NCO) is a promising approach for training a unified model capable of solving multiple Vehicle Routing Problem (VRP) variants. However, existing Reinforcement Learning (RL)-based multi-task methods can only train light decoder models on small-scale problems, exhibiting limited generalization ability when solving large-scale problems. To overcome this limitation, this work introduces a novel multi-task learning method driven by knowledge distillation (MTL-KD), which enables efficient training of heavy decoder models with strong generalization ability. The proposed MTL-KD method transfers policy knowledge from multiple distinct RL-based single-task models to a single heavy decoder model, facilitating label-free training and effectively improving the model's generalization ability across diverse tasks. In addition, we introduce a flexible inference strategy termed Random Reordering Re-Construction (R3C), which is specifically adapted for diverse VRP tasks and further boosts the performance of the multi-task model. Experimental results on 6 seen and 10 unseen VRP variants with up to 1,000 nodes indicate that our proposed method consistently achieves superior performance on both uniform and real-world benchmarks, demonstrating robust generalization abilities. The code is available at [https://github.com/CIAM-Group/MTLKD](https://github.com/CIAM-Group/MTLKD).
Cite
Text
Zheng et al. "MTL-KD: Multi-Task Learning via Knowledge Distillation for Generalizable Neural Vehicle Routing Solver." Advances in Neural Information Processing Systems, 2025.Markdown
[Zheng et al. "MTL-KD: Multi-Task Learning via Knowledge Distillation for Generalizable Neural Vehicle Routing Solver." Advances in Neural Information Processing Systems, 2025.](https://mlanthology.org/neurips/2025/zheng2025neurips-mtlkd/)BibTeX
@inproceedings{zheng2025neurips-mtlkd,
title = {{MTL-KD: Multi-Task Learning via Knowledge Distillation for Generalizable Neural Vehicle Routing Solver}},
author = {Zheng, Yuepeng and Luo, Fu and Wang, Zhenkun and Wu, Yaoxin and Zhou, Yu},
booktitle = {Advances in Neural Information Processing Systems},
year = {2025},
url = {https://mlanthology.org/neurips/2025/zheng2025neurips-mtlkd/}
}