Multi-Play Multi-Armed Bandits with Shareable Arm Capacities Revisited: Settling Scarce Capacity

Li, Hanyang; Xie, Hong; Lian, Defu

Multi-Play Multi-Armed Bandits with Shareable Arm Capacities Revisited: Settling Scarce Capacity

ACML 2025 pp. 223-238

/acml/2025/li2025acml-multiplay/

Abstract

This paper revisits multi-play multi-armed bandit with shareable arm capacities problem, which is tailored for resource allocation problems arising from LLM inference serving, edge intelligence, etc. We investigate the capacity-scarce setting, a common dilemma in resource allocation problems. Existing works yield sub-optimal solutions under this setting, as they rely heavily on the assumption of abundant capacities. This paper present a rather complete solution to this setting and it makes three key contributions. We establish a minmax lower bound for the sample complexity of learning the arm capacities and propose an algorithm that exactly matches this lower bound. We derive both instance-independent and instance-dependent regret lower bounds for learning the optimal play assignment. We introduce an efficient exploration algorithm named \\texttt\{PC-CapUL\} for the capacity-scarce setting and \\texttt\{PC-CapUL\} matches the regret lower bounds up to an acceptable constant. \\texttt\{PC-CapUL\} features a novel index for coordinating the exploration of multiple plays. Experiments show significant improvement over existing methods.

PDF ACML Semantic Scholar

Cite

Text

Li et al. "Multi-Play Multi-Armed Bandits with Shareable Arm Capacities Revisited: Settling Scarce Capacity." Proceedings of the 17th Asian Conference on Machine Learning, 2025.

Markdown

[Li et al. "Multi-Play Multi-Armed Bandits with Shareable Arm Capacities Revisited: Settling Scarce Capacity." Proceedings of the 17th Asian Conference on Machine Learning, 2025.](https://mlanthology.org/acml/2025/li2025acml-multiplay/)

BibTeX

@inproceedings{li2025acml-multiplay,
  title     = {{Multi-Play Multi-Armed Bandits with Shareable Arm Capacities Revisited: Settling Scarce Capacity}},
  author    = {Li, Hanyang and Xie, Hong and Lian, Defu},
  booktitle = {Proceedings of the 17th Asian Conference on Machine Learning},
  year      = {2025},
  pages     = {223-238},
  volume    = {304},
  url       = {https://mlanthology.org/acml/2025/li2025acml-multiplay/}
}