Hannan Consistency in On-Line Learning in Case of Unbounded Losses Under Partial Monitoring

Abstract

In this paper the sequential prediction problem with expert advice is considered when the loss is unbounded under partial monitoring scenarios. We deal with a wide class of the partial monitoring problems: the combination of the label efficient and multi-armed bandit problem, that is, where the algorithm is only informed about the performance of the chosen expert with probability ε ≤1. For bounded losses an algorithm is given whose expected regret scales with the square root of the loss of the best expert. For unbounded losses we prove that Hannan consistency can be achieved, depending on the growth rate of the average squared losses of the experts.

Cite

Text

Allenberg et al. "Hannan Consistency in On-Line Learning in Case of Unbounded Losses Under Partial Monitoring." International Conference on Algorithmic Learning Theory, 2006. doi:10.1007/11894841_20

Markdown

[Allenberg et al. "Hannan Consistency in On-Line Learning in Case of Unbounded Losses Under Partial Monitoring." International Conference on Algorithmic Learning Theory, 2006.](https://mlanthology.org/alt/2006/allenberg2006alt-hannan/) doi:10.1007/11894841_20

BibTeX

@inproceedings{allenberg2006alt-hannan,
  title     = {{Hannan Consistency in On-Line Learning in Case of Unbounded Losses Under Partial Monitoring}},
  author    = {Allenberg, Chamy and Auer, Peter and Györfi, László and Ottucsák, György},
  booktitle = {International Conference on Algorithmic Learning Theory},
  year      = {2006},
  pages     = {229-243},
  doi       = {10.1007/11894841_20},
  url       = {https://mlanthology.org/alt/2006/allenberg2006alt-hannan/}
}