Online Learning with Dependent Stochastic Feedback Graphs
Abstract
A general framework for online learning with partial information is one where feedback graphs specify which losses can be observed by the learner. We study a challenging scenario where feedback graphs vary stochastically with time and, more importantly, where graphs and losses are dependent. This scenario appears in several real-world applications that we describe where the outcome of actions are correlated. We devise a new algorithm for this setting that exploits the stochastic properties of the graphs and that benefits from favorable regret guarantees. We present a detailed theoretical analysis of this algorithm, and also report the result of a series of experiments on real-world datasets, which show that our algorithm outperforms standard baselines for online learning with feedback graphs.
Cite
Text
Cortes et al. "Online Learning with Dependent Stochastic Feedback Graphs." International Conference on Machine Learning, 2020.Markdown
[Cortes et al. "Online Learning with Dependent Stochastic Feedback Graphs." International Conference on Machine Learning, 2020.](https://mlanthology.org/icml/2020/cortes2020icml-online/)BibTeX
@inproceedings{cortes2020icml-online,
title = {{Online Learning with Dependent Stochastic Feedback Graphs}},
author = {Cortes, Corinna and Desalvo, Giulia and Gentile, Claudio and Mohri, Mehryar and Zhang, Ningshan},
booktitle = {International Conference on Machine Learning},
year = {2020},
pages = {2154-2163},
volume = {119},
url = {https://mlanthology.org/icml/2020/cortes2020icml-online/}
}