Mean-Field RL for Large-Scale Unit-Capacity Pickup-and-Delivery Problems
Abstract
Solving large-scale vehicle routing problems (VRPs) is NP-hard and poses a computational challenge in numerous applications such as logistics. Meanwhile, mean-field control (MFC) provides a tractable and rigorous approach to controlling many agents. We provide a solution to pickup-and-delivery VRPs via scalable MFC. In combination with reinforcement learning (RL) and clustering, our MFC approach efficiently scales to large-scale VRPs. We perform a theoretical analysis of our MFC-based approximation, giving convergence results for large VRP instances and error bounds for clustering-based approximations. We verify our algorithms on different datasets and compare them against solutions such as OR-Tools, PyVRP and heuristics, showing scalability in terms of speed for mean-field methods, for the first time in discrete optimization. Overall, our work establishes a novel synthesis of MFC-based RL techniques, vehicle routing problems and clustering approximations, to solve a hard discrete optimization problem of practical use in a scalable way.
Cite
Text
Cui et al. "Mean-Field RL for Large-Scale Unit-Capacity Pickup-and-Delivery Problems." Transactions on Machine Learning Research, 2025.Markdown
[Cui et al. "Mean-Field RL for Large-Scale Unit-Capacity Pickup-and-Delivery Problems." Transactions on Machine Learning Research, 2025.](https://mlanthology.org/tmlr/2025/cui2025tmlr-meanfield/)BibTeX
@article{cui2025tmlr-meanfield,
title = {{Mean-Field RL for Large-Scale Unit-Capacity Pickup-and-Delivery Problems}},
author = {Cui, Kai and Azem, Sharif and Fabian, Christian and Kuroptev, Kirill and Khalili, Ramin and Abboud, Osama and Steinke, Florian and Koeppl, Heinz},
journal = {Transactions on Machine Learning Research},
year = {2025},
url = {https://mlanthology.org/tmlr/2025/cui2025tmlr-meanfield/}
}