Combinatorial Multi-Armed Bandits with Fairness Constraints: An Online Convex Optimization Perspective
Abstract
The problem of multi-armed bandit (MAB) with fairness constraints has emerged as an important research topic recently. For such problems, one common objective is to maximize the total rewards within a fixed number of pull rounds, while satisfying the fairness requirement of a minimum selection fraction for each individual arm in the long run. Previous works have made substantial advancements in designing various online selection solutions for MAB, however, when incorporating such fairness constraints, they fail to achieve a sublinear regret bound. In this paper, we study a combinatorial MAB problem with concave objective and fairness constraints. In particular, we design a new selection algorithm that solves MAB problems from an online convex optimization perspective. Our algorithm is computationally efficient, and more importantly, manages to achieve a sublinear regret bound of O( √ T ln T) with high probability guarantees in T selection rounds. We also extend our framework to include more general knapsack constraints. Finally, we assess the performance of our algorithm through extensive simulations and real dataset applications, demonstrating its significant advantages over baseline schemes.
Cite
Text
Chen et al. "Combinatorial Multi-Armed Bandits with Fairness Constraints: An Online Convex Optimization Perspective." Journal of Artificial Intelligence Research, 2025. doi:10.1613/JAIR.1.16580Markdown
[Chen et al. "Combinatorial Multi-Armed Bandits with Fairness Constraints: An Online Convex Optimization Perspective." Journal of Artificial Intelligence Research, 2025.](https://mlanthology.org/jair/2025/chen2025jair-combinatorial/) doi:10.1613/JAIR.1.16580BibTeX
@article{chen2025jair-combinatorial,
title = {{Combinatorial Multi-Armed Bandits with Fairness Constraints: An Online Convex Optimization Perspective}},
author = {Chen, Xiaosong and Zhuang, Hanqin and Liu, Yang and Xu, Huanle and Lau, Wing Cheong},
journal = {Journal of Artificial Intelligence Research},
year = {2025},
pages = {1909-1942},
doi = {10.1613/JAIR.1.16580},
volume = {82},
url = {https://mlanthology.org/jair/2025/chen2025jair-combinatorial/}
}