Top Feasible Arm Identification
Abstract
We propose a new variant of the top arm identification problem, \emph{top feasible arm identification}, where there are $K$ arms associated with $D$-dimensional distributions and the goal is to find $m$ arms that maximize some known linear function of their means subject to the constraint that their means belong to a given set $P \subset R^D$. This problem has many applications since in many settings, feedback is multi-dimensional and it is of interest to perform \emph{constrained maximization}. We present problem-dependent lower bounds for top feasible arm identification and upper bounds for several algorithms. Our most broadly applicable algorithm, TF-LUCB-B, has an upper bound that is loose by a factor of $O(D \log(K))$. Many problems of practical interest are two-dimensional and, for these, it is loose by a factor of $O(\log(K))$. Finally, we conduct experiments on synthetic and real-world datasets that demonstrate the effectiveness of our algorithms. Our algorithms are superior both in theory and in practice to a naive two-stage algorithm that first identifies the feasible arms and then applies a best arm identification algorithm to the feasible arms.
Cite
Text
Katz-Samuels and Scott. "Top Feasible Arm Identification." Artificial Intelligence and Statistics, 2019.Markdown
[Katz-Samuels and Scott. "Top Feasible Arm Identification." Artificial Intelligence and Statistics, 2019.](https://mlanthology.org/aistats/2019/katzsamuels2019aistats-top/)BibTeX
@inproceedings{katzsamuels2019aistats-top,
title = {{Top Feasible Arm Identification}},
author = {Katz-Samuels, Julian and Scott, Clayton},
booktitle = {Artificial Intelligence and Statistics},
year = {2019},
pages = {1593-1601},
volume = {89},
url = {https://mlanthology.org/aistats/2019/katzsamuels2019aistats-top/}
}