Following the Perturbed Leader for Online Structured Learning

Abstract

We investigate a new Follow the Perturbed Leader (FTPL) algorithm for online structured prediction problems. We show a regret bound which is comparable to the state of the art of FTPL algorithms and is comparable with the best possible regret in some cases. To better understand FTPL algorithms for online structured learning, we present a lower bound on the regret for a large and natural class of FTPL algorithms that use logconcave perturbations. We complete our investigation with an online shortest path experiment and empirically show that our algorithm is both statistically and computationally efficient.

Cite

Text

Cohen and Hazan. "Following the Perturbed Leader for Online Structured Learning." International Conference on Machine Learning, 2015.

Markdown

[Cohen and Hazan. "Following the Perturbed Leader for Online Structured Learning." International Conference on Machine Learning, 2015.](https://mlanthology.org/icml/2015/cohen2015icml-following/)

BibTeX

@inproceedings{cohen2015icml-following,
  title     = {{Following the Perturbed Leader for Online Structured Learning}},
  author    = {Cohen, Alon and Hazan, Tamir},
  booktitle = {International Conference on Machine Learning},
  year      = {2015},
  pages     = {1034-1042},
  volume    = {37},
  url       = {https://mlanthology.org/icml/2015/cohen2015icml-following/}
}