Efficient Learning in Polyhedral Games via Best Response Oracles

Abstract

We study online learning and equilibrium computation in games with polyhedral decision sets with only first-order oracle and best-response oracle access. Our approach achieves constant regret in zero-sum games and $O(T^{1/4})$ in general-sum games while using only $O(\log t)$ best-response queries at a given iteration $t$. This convergence occurs at a linear rate, though with a condition-number dependence. Our algorithm also achieves best-iterate convergence at a rate of $O(1/\sqrt{T})$ without such a dependence. Our algorithm uses a linearly convergent variant of Frank-Wolfe (FW) whose linear convergence depends on a condition number of the polytope known as the facial distance. We show two broad new results, characterizing the facial distance when the polyhedral sets satisfy a certain structure.

Cite

Text

Chakrabarti et al. "Efficient Learning in Polyhedral Games via Best Response Oracles." NeurIPS 2023 Workshops: OPT, 2023.

Markdown

[Chakrabarti et al. "Efficient Learning in Polyhedral Games via Best Response Oracles." NeurIPS 2023 Workshops: OPT, 2023.](https://mlanthology.org/neuripsw/2023/chakrabarti2023neuripsw-efficient/)

BibTeX

@inproceedings{chakrabarti2023neuripsw-efficient,
  title     = {{Efficient Learning in Polyhedral Games via Best Response Oracles}},
  author    = {Chakrabarti, Darshan and Farina, Gabriele and Kroer, Christian},
  booktitle = {NeurIPS 2023 Workshops: OPT},
  year      = {2023},
  url       = {https://mlanthology.org/neuripsw/2023/chakrabarti2023neuripsw-efficient/}
}