Nearly Tight Bounds for Exploration in Streaming Multi-Armed Bandits with Known Optimality Gap
Abstract
We investigate the sample-memory-pass trade-offs for pure exploration in multi-pass streaming multi-armed bandits (MABs) with the *a priori* knowledge of the optimality gap ?_[2]. Here, and throughout, the optimality gap ?_[i] is defined as the mean reward gap between the best and the i-th best arms. A recent line of results have shown that if there is no known ?_[2], a pass complexity of ̃?(log(1/?_[2])) is necessary and sufficient to obtain the *worst-case optimal* O(n/?²_[2]) sample complexity with a single-arm memory. However, our understanding of multi-pass algorithms with known ?_[2] is still limited. Here, the key open problem is how many passes are required to achieve the complexity, i.e., O( ∑ᵢ₌₂ⁿ1/?²_[i] log{n}) arm pulls, with a sublinear memory size. In this work, we show that the ``right answer'' for the question is ̃?(log{n}) passes. We first present a lower bound, showing that any algorithm that finds the best arm with slightly sublinear memory -- a memory of o(n/polylog(n)) arms -- and O( ∑ᵢ₌₂ⁿ1/?²_[i] log n) arm pulls has to make ?(log n/loglog n) passes over the stream. We then show a nearly-matching algorithm that assuming the knowledge of ?_[2], finds the best arm with O( ∑ᵢ₌₂ⁿ1/?²_[i] log n) arm pulls and a *single arm* memory.
Cite
Text
Karpov and Wang. "Nearly Tight Bounds for Exploration in Streaming Multi-Armed Bandits with Known Optimality Gap." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I17.33956Markdown
[Karpov and Wang. "Nearly Tight Bounds for Exploration in Streaming Multi-Armed Bandits with Known Optimality Gap." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/karpov2025aaai-nearly/) doi:10.1609/AAAI.V39I17.33956BibTeX
@inproceedings{karpov2025aaai-nearly,
title = {{Nearly Tight Bounds for Exploration in Streaming Multi-Armed Bandits with Known Optimality Gap}},
author = {Karpov, Nikolai and Wang, Chen},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2025},
pages = {17788-17796},
doi = {10.1609/AAAI.V39I17.33956},
url = {https://mlanthology.org/aaai/2025/karpov2025aaai-nearly/}
}