Combinatorial Multi-Armed Bandits with Fairness Constraints: An Online Convex Optimization Perspective

Chen, Xiaosong; Zhuang, Hanqin; Liu, Yang; Xu, Huanle; Lau, Wing Cheong

doi:10.1613/JAIR.1.16580

Combinatorial Multi-Armed Bandits with Fairness Constraints: An Online Convex Optimization Perspective

Xiaosong Chen, Hanqin Zhuang, Yang Liu, Huanle Xu, Wing Cheong Lau

JAIR 2025 pp. 1909-1942

doi:10.1613/JAIR.1.16580 /jair/2025/chen2025jair-combinatorial/

Abstract

The problem of multi-armed bandit (MAB) with fairness constraints has emerged as an important research topic recently. For such problems, one common objective is to maximize the total rewards within a fixed number of pull rounds, while satisfying the fairness requirement of a minimum selection fraction for each individual arm in the long run. Previous works have made substantial advancements in designing various online selection solutions for MAB, however, when incorporating such fairness constraints, they fail to achieve a sublinear regret bound. In this paper, we study a combinatorial MAB problem with concave objective and fairness constraints. In particular, we design a new selection algorithm that solves MAB problems from an online convex optimization perspective. Our algorithm is computationally efficient, and more importantly, manages to achieve a sublinear regret bound of O( √ T ln T) with high probability guarantees in T selection rounds. We also extend our framework to include more general knapsack constraints. Finally, we assess the performance of our algorithm through extensive simulations and real dataset applications, demonstrating its significant advantages over baseline schemes.

PDF JAIR Semantic Scholar

Cite

Text

Chen et al. "Combinatorial Multi-Armed Bandits with Fairness Constraints: An Online Convex Optimization Perspective." Journal of Artificial Intelligence Research, 2025. doi:10.1613/JAIR.1.16580

Markdown

[Chen et al. "Combinatorial Multi-Armed Bandits with Fairness Constraints: An Online Convex Optimization Perspective." Journal of Artificial Intelligence Research, 2025.](https://mlanthology.org/jair/2025/chen2025jair-combinatorial/) doi:10.1613/JAIR.1.16580

BibTeX

@article{chen2025jair-combinatorial,
  title     = {{Combinatorial Multi-Armed Bandits with Fairness Constraints: An Online Convex Optimization Perspective}},
  author    = {Chen, Xiaosong and Zhuang, Hanqin and Liu, Yang and Xu, Huanle and Lau, Wing Cheong},
  journal   = {Journal of Artificial Intelligence Research},
  year      = {2025},
  pages     = {1909-1942},
  doi       = {10.1613/JAIR.1.16580},
  volume    = {82},
  url       = {https://mlanthology.org/jair/2025/chen2025jair-combinatorial/}
}