Adversarial Online Learning with Temporal Feedback Graphs
Abstract
We study a variant of prediction with expert advice where the learner’s action at round $t$ is only allowed to depend on losses on a specific subset of the rounds (where the structure of which rounds’ losses are visible at time $t$ is provided by a directed “feedback graph” known to the learner). We present a novel learning algorithm for this setting based on a strategy of partitioning the losses across sub-cliques of this graph. We complement this with a lower bound that is tight in many practical settings, and which we conjecture to be within a constant factor of optimal. For the important class of transitive feedback graphs, we prove that this algorithm is efficiently implementable and obtains the optimal regret bound (up to a universal constant).
Cite
Text
Gatmiry and Schneider. "Adversarial Online Learning with Temporal Feedback Graphs." Conference on Learning Theory, 2024.Markdown
[Gatmiry and Schneider. "Adversarial Online Learning with Temporal Feedback Graphs." Conference on Learning Theory, 2024.](https://mlanthology.org/colt/2024/gatmiry2024colt-adversarial/)BibTeX
@inproceedings{gatmiry2024colt-adversarial,
title = {{Adversarial Online Learning with Temporal Feedback Graphs}},
author = {Gatmiry, Khashayar and Schneider, Jon},
booktitle = {Conference on Learning Theory},
year = {2024},
pages = {4548-4572},
volume = {247},
url = {https://mlanthology.org/colt/2024/gatmiry2024colt-adversarial/}
}