Practical Contextual Bandits with Feedback Graphs

Mengxiao Zhang, Yuheng Zhang, Olga Vrousgou, Haipeng Luo, Paul Mineiro

NeurIPS 2023

/neurips/2023/zhang2023neurips-practical/

Abstract

While contextual bandit has a mature theory, effectively leveraging different feedback patterns to enhance the pace of learning remains unclear. Bandits with feedback graphs, which interpolates between the full information and bandit regimes, provides a promising framework to mitigate the statistical complexity of learning. In this paper, we propose and analyze an approach to contextual bandits with feedback graphs based upon reduction to regression. The resulting algorithms are computationally practical and achieve established minimax rates, thereby reducing the statistical complexity in real-world applications.

PDF NeurIPS OpenReview Semantic Scholar

Cite

Text

Zhang et al. "Practical Contextual Bandits with Feedback Graphs." Neural Information Processing Systems, 2023.

Markdown

[Zhang et al. "Practical Contextual Bandits with Feedback Graphs." Neural Information Processing Systems, 2023.](https://mlanthology.org/neurips/2023/zhang2023neurips-practical/)

BibTeX

@inproceedings{zhang2023neurips-practical,
  title     = {{Practical Contextual Bandits with Feedback Graphs}},
  author    = {Zhang, Mengxiao and Zhang, Yuheng and Vrousgou, Olga and Luo, Haipeng and Mineiro, Paul},
  booktitle = {Neural Information Processing Systems},
  year      = {2023},
  url       = {https://mlanthology.org/neurips/2023/zhang2023neurips-practical/}
}