Online Learning with Dependent Stochastic Feedback Graphs

Cortes, Corinna; Desalvo, Giulia; Gentile, Claudio; Mohri, Mehryar; Zhang, Ningshan

Online Learning with Dependent Stochastic Feedback Graphs

Corinna Cortes, Giulia Desalvo, Claudio Gentile, Mehryar Mohri, Ningshan Zhang

ICML 2020 pp. 2154-2163

/icml/2020/cortes2020icml-online/

Abstract

A general framework for online learning with partial information is one where feedback graphs specify which losses can be observed by the learner. We study a challenging scenario where feedback graphs vary stochastically with time and, more importantly, where graphs and losses are dependent. This scenario appears in several real-world applications that we describe where the outcome of actions are correlated. We devise a new algorithm for this setting that exploits the stochastic properties of the graphs and that benefits from favorable regret guarantees. We present a detailed theoretical analysis of this algorithm, and also report the result of a series of experiments on real-world datasets, which show that our algorithm outperforms standard baselines for online learning with feedback graphs.

PDF ICML Semantic Scholar

Cite

Text

Cortes et al. "Online Learning with Dependent Stochastic Feedback Graphs." International Conference on Machine Learning, 2020.

Markdown

[Cortes et al. "Online Learning with Dependent Stochastic Feedback Graphs." International Conference on Machine Learning, 2020.](https://mlanthology.org/icml/2020/cortes2020icml-online/)

BibTeX

@inproceedings{cortes2020icml-online,
  title     = {{Online Learning with Dependent Stochastic Feedback Graphs}},
  author    = {Cortes, Corinna and Desalvo, Giulia and Gentile, Claudio and Mohri, Mehryar and Zhang, Ningshan},
  booktitle = {International Conference on Machine Learning},
  year      = {2020},
  pages     = {2154-2163},
  volume    = {119},
  url       = {https://mlanthology.org/icml/2020/cortes2020icml-online/}
}