Flexible Budgets in Restless Bandits: A Primal-Dual Algorithm for Efficient Budget Allocation

Abstract

Restless multi-armed bandits (RMABs) are an important model to optimize allocation of limited resources in sequential decision-making settings. Typical RMABs assume the budget --- the number of arms pulled --- to be fixed for each step in the planning horizon. However, for realistic real-world planning, resources are not necessarily limited at each planning step; we may be able to distribute surplus resources in one round to an earlier or later round. In real-world planning settings, this flexibility in budget is often constrained to within a subset of consecutive planning steps, e.g., weekly planning of a monthly budget. In this paper we define a general class of RMABs with flexible budget, which we term F-RMABs, and provide an algorithm to optimally solve for them. We derive a min-max formulation to find optimal policies for F-RMABs and leverage gradient primal-dual algorithms to solve for reward-maximizing policies with flexible budgets. We introduce a scheme to sample expected gradients to apply primal-dual algorithms to the F-RMAB setting and make an otherwise computationally expensive approach tractable. Additionally, we provide heuristics that trade off solution quality for efficiency and present experimental comparisons of different F-RMAB solution approaches.

Cite

Text

Diaz et al. "Flexible Budgets in Restless Bandits: A Primal-Dual Algorithm for Efficient Budget Allocation." AAAI Conference on Artificial Intelligence, 2023. doi:10.1609/AAAI.V37I10.26427

Markdown

[Diaz et al. "Flexible Budgets in Restless Bandits: A Primal-Dual Algorithm for Efficient Budget Allocation." AAAI Conference on Artificial Intelligence, 2023.](https://mlanthology.org/aaai/2023/diaz2023aaai-flexible/) doi:10.1609/AAAI.V37I10.26427

BibTeX

@inproceedings{diaz2023aaai-flexible,
  title     = {{Flexible Budgets in Restless Bandits: A Primal-Dual Algorithm for Efficient Budget Allocation}},
  author    = {Diaz, Paula Rodriguez and Killian, Jackson A. and Xu, Lily and Suggala, Arun Sai and Taneja, Aparna and Tambe, Milind},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2023},
  pages     = {12103-12111},
  doi       = {10.1609/AAAI.V37I10.26427},
  url       = {https://mlanthology.org/aaai/2023/diaz2023aaai-flexible/}
}