A Bad Arm Existence Checking Problem: How to Utilize Asymmetric Problem Structure?
Abstract
We study a bad arm existence checking problem in a stochastic K-armed bandit setting, in which a player’s task is to judge whether a positive arm exists or all the arms are negative among given K arms by drawing as small number of arms as possible. Here, an arm is positive if its expected loss suffered by drawing the arm is at least a given threshold θU\documentclass[12pt]minimal \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}-69pt \begin{document}$\theta _U$\end{document}, and it is negative if that is less than another given threshold θL(≤θU)\documentclass[12pt]minimal \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}-69pt \begin{document}$\theta _L(\le \theta _U)$\end{document}. This problem is a formalization of diagnosis of disease or machine failure. An interesting structure of this problem is the asymmetry of positive and negative arms’ roles; finding one positive arm is enough to judge positive existence while all the arms must be discriminated as negative to judge whole negativity. In the case with Δ=θU-θL>0\documentclass[12pt]minimal \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}-69pt \begin{document}$\varDelta =\theta _U-\theta _L>0$\end{document}, we propose elimination algorithms with arm selection policy (policy to determine the next arm to draw) and decision condition (condition to conclude positive arm’s existence or the drawn arm’s negativity) utilizing this asymmetric problem structure and prove its effectiveness theoretically and empirically.
Cite
Text
Tabata et al. "A Bad Arm Existence Checking Problem: How to Utilize Asymmetric Problem Structure?." Machine Learning, 2020. doi:10.1007/S10994-019-05854-7Markdown
[Tabata et al. "A Bad Arm Existence Checking Problem: How to Utilize Asymmetric Problem Structure?." Machine Learning, 2020.](https://mlanthology.org/mlj/2020/tabata2020mlj-bad/) doi:10.1007/S10994-019-05854-7BibTeX
@article{tabata2020mlj-bad,
title = {{A Bad Arm Existence Checking Problem: How to Utilize Asymmetric Problem Structure?}},
author = {Tabata, Koji and Nakamura, Atsuyoshi and Honda, Junya and Komatsuzaki, Tamiki},
journal = {Machine Learning},
year = {2020},
pages = {327-372},
doi = {10.1007/S10994-019-05854-7},
volume = {109},
url = {https://mlanthology.org/mlj/2020/tabata2020mlj-bad/}
}