Stochastic Online Learning with Feedback Graphs: Finite-Time and Asymptotic Optimality

Abstract

We revisit the problem of stochastic online learning with feedbackgraphs, with the goal of devising algorithms that are optimal, up toconstants, both asymptotically and in finite time. We show that,surprisingly, the notion of optimal finite-time regret is not auniquely defined property in this context and that, in general, itis decoupled from the asymptotic rate. We discuss alternativechoices and propose a notion of finite-time optimality that we argueis \emph{meaningful}. For that notion, we give an algorithm thatadmits quasi-optimal regret both in finite-time and asymptotically.

Cite

Text

Marinov et al. "Stochastic Online Learning with Feedback Graphs: Finite-Time and Asymptotic Optimality." Neural Information Processing Systems, 2022.

Markdown

[Marinov et al. "Stochastic Online Learning with Feedback Graphs: Finite-Time and Asymptotic Optimality." Neural Information Processing Systems, 2022.](https://mlanthology.org/neurips/2022/marinov2022neurips-stochastic/)

BibTeX

@inproceedings{marinov2022neurips-stochastic,
  title     = {{Stochastic Online Learning with Feedback Graphs: Finite-Time and Asymptotic Optimality}},
  author    = {Marinov, Teodor Vanislavov and Mohri, Mehryar and Zimmert, Julian},
  booktitle = {Neural Information Processing Systems},
  year      = {2022},
  url       = {https://mlanthology.org/neurips/2022/marinov2022neurips-stochastic/}
}