Bandit Pareto Set Identification: The Fixed Budget Setting

Abstract

We study a multi-objective pure exploration problem in a multi-armed bandit model. Each arm is associated to an unknown multi-variate distribution and the goal is to identify the distributions whose mean is not uniformly worse than that of another distribution: the Pareto optimal set. We propose and analyze the first algorithms for the \emph{fixed budget} Pareto Set Identification task. We propose Empirical Gap Elimination, a family of algorithms combining a careful estimation of the “hardness to classify” each arm in or out of the Pareto set with a generic elimination scheme. We prove that two particular instances, EGE-SR and EGE-SH, have a probability of error that decays exponentially fast with the budget, with an exponent supported by an information theoretic lower-bound. We complement these findings with an empirical study using real-world and synthetic datasets, which showcase the good performance of our algorithms.

Cite

Text

Kone et al. "Bandit Pareto Set Identification: The Fixed Budget Setting." Artificial Intelligence and Statistics, 2024.

Markdown

[Kone et al. "Bandit Pareto Set Identification: The Fixed Budget Setting." Artificial Intelligence and Statistics, 2024.](https://mlanthology.org/aistats/2024/kone2024aistats-bandit/)

BibTeX

@inproceedings{kone2024aistats-bandit,
  title     = {{Bandit Pareto Set Identification: The Fixed Budget Setting}},
  author    = {Kone, Cyrille and Kaufmann, Emilie and Richert, Laura},
  booktitle = {Artificial Intelligence and Statistics},
  year      = {2024},
  pages     = {2548-2556},
  volume    = {238},
  url       = {https://mlanthology.org/aistats/2024/kone2024aistats-bandit/}
}