Multi-Play Multi-Armed Bandits with Shareable Arm Capacities Revisited: Settling Scarce Capacity
Abstract
This paper revisits multi-play multi-armed bandit with shareable arm capacities problem, which is tailored for resource allocation problems arising from LLM inference serving, edge intelligence, etc. We investigate the capacity-scarce setting, a common dilemma in resource allocation problems. Existing works yield sub-optimal solutions under this setting, as they rely heavily on the assumption of abundant capacities. This paper present a rather complete solution to this setting and it makes three key contributions. We establish a minmax lower bound for the sample complexity of learning the arm capacities and propose an algorithm that exactly matches this lower bound. We derive both instance-independent and instance-dependent regret lower bounds for learning the optimal play assignment. We introduce an efficient exploration algorithm named \\texttt\{PC-CapUL\} for the capacity-scarce setting and \\texttt\{PC-CapUL\} matches the regret lower bounds up to an acceptable constant. \\texttt\{PC-CapUL\} features a novel index for coordinating the exploration of multiple plays. Experiments show significant improvement over existing methods.
Cite
Text
Li et al. "Multi-Play Multi-Armed Bandits with Shareable Arm Capacities Revisited: Settling Scarce Capacity." Proceedings of the 17th Asian Conference on Machine Learning, 2025.Markdown
[Li et al. "Multi-Play Multi-Armed Bandits with Shareable Arm Capacities Revisited: Settling Scarce Capacity." Proceedings of the 17th Asian Conference on Machine Learning, 2025.](https://mlanthology.org/acml/2025/li2025acml-multiplay/)BibTeX
@inproceedings{li2025acml-multiplay,
title = {{Multi-Play Multi-Armed Bandits with Shareable Arm Capacities Revisited: Settling Scarce Capacity}},
author = {Li, Hanyang and Xie, Hong and Lian, Defu},
booktitle = {Proceedings of the 17th Asian Conference on Machine Learning},
year = {2025},
pages = {223-238},
volume = {304},
url = {https://mlanthology.org/acml/2025/li2025acml-multiplay/}
}