Minimax Regret for Partial Monitoring: Infinite Outcomes and Rustichini’s Regret

Abstract

We show that a version of the generalised information ratio of Lattimore and Gyorgy (2020) determines the asymptotic minimax regret for all finite-action partial monitoring games provided that (a) the standard definition of regret is used but the latent space where the adversary plays is potentially infinite; or (b) the regret introduced by Rustichini (1999) is used and the latent space is finite. Our results are complemented by a number of examples. For any p ∈ [1/2, 1] there exists an infinite partial monitoring game for which the minimax regret over n rounds is n^p up to subpolynomial factors and there exist finite games for which the minimax Rustichini regret is n^(4/7) up to subpolynomial factors.

Cite

Text

Lattimore. "Minimax Regret for Partial Monitoring: Infinite Outcomes and Rustichini’s Regret." Conference on Learning Theory, 2022.

Markdown

[Lattimore. "Minimax Regret for Partial Monitoring: Infinite Outcomes and Rustichini’s Regret." Conference on Learning Theory, 2022.](https://mlanthology.org/colt/2022/lattimore2022colt-minimax/)

BibTeX

@inproceedings{lattimore2022colt-minimax,
  title     = {{Minimax Regret for Partial Monitoring: Infinite Outcomes and Rustichini’s Regret}},
  author    = {Lattimore, Tor},
  booktitle = {Conference on Learning Theory},
  year      = {2022},
  pages     = {1547-1575},
  volume    = {178},
  url       = {https://mlanthology.org/colt/2022/lattimore2022colt-minimax/}
}