Adversarial Online Learning with Temporal Feedback Graphs

Abstract

We study a variant of prediction with expert advice where the learner’s action at round $t$ is only allowed to depend on losses on a specific subset of the rounds (where the structure of which rounds’ losses are visible at time $t$ is provided by a directed “feedback graph” known to the learner). We present a novel learning algorithm for this setting based on a strategy of partitioning the losses across sub-cliques of this graph. We complement this with a lower bound that is tight in many practical settings, and which we conjecture to be within a constant factor of optimal. For the important class of transitive feedback graphs, we prove that this algorithm is efficiently implementable and obtains the optimal regret bound (up to a universal constant).

Cite

Text

Gatmiry and Schneider. "Adversarial Online Learning with Temporal Feedback Graphs." Conference on Learning Theory, 2024.

Markdown

[Gatmiry and Schneider. "Adversarial Online Learning with Temporal Feedback Graphs." Conference on Learning Theory, 2024.](https://mlanthology.org/colt/2024/gatmiry2024colt-adversarial/)

BibTeX

@inproceedings{gatmiry2024colt-adversarial,
  title     = {{Adversarial Online Learning with Temporal Feedback Graphs}},
  author    = {Gatmiry, Khashayar and Schneider, Jon},
  booktitle = {Conference on Learning Theory},
  year      = {2024},
  pages     = {4548-4572},
  volume    = {247},
  url       = {https://mlanthology.org/colt/2024/gatmiry2024colt-adversarial/}
}