Unconstrained Online Linear Learning in Hilbert Spaces: Minimax Algorithms and Normal Approximations

COLT 2014 pp. 1020-1039

/colt/2014/mcmahan2014colt-unconstrained/

Abstract

We study algorithms for online linear optimization in Hilbert spaces, focusing on the case where the player is unconstrained. We develop a novel characterization of a large class of minimax algorithms, recovering, and even improving, several previous results as immediate corollaries. Moreover, using our tools, we develop an algorithm that provides a regret bound ofO U q T log(U p T log 2 T + 1) , where U is the L2 norm of an arbitrary comparator and both T and U are unknown to the player. This bound is optimal up to p log logT terms. When T is known, we derive an algorithm with an optimal regret bound (up to constant factors). For both the known and unknown T case, a Normal approximation to the conditional value of the game proves to be the key analysis tool.

PDF COLT Semantic Scholar

Cite

Text

McMahan and Orabona. "Unconstrained Online Linear Learning in Hilbert Spaces: Minimax Algorithms and Normal Approximations." Annual Conference on Computational Learning Theory, 2014.

Markdown

[McMahan and Orabona. "Unconstrained Online Linear Learning in Hilbert Spaces: Minimax Algorithms and Normal Approximations." Annual Conference on Computational Learning Theory, 2014.](https://mlanthology.org/colt/2014/mcmahan2014colt-unconstrained/)

BibTeX

@inproceedings{mcmahan2014colt-unconstrained,
  title     = {{Unconstrained Online Linear Learning in Hilbert Spaces: Minimax Algorithms and Normal Approximations}},
  author    = {McMahan, H. Brendan and Orabona, Francesco},
  booktitle = {Annual Conference on Computational Learning Theory},
  year      = {2014},
  pages     = {1020-1039},
  url       = {https://mlanthology.org/colt/2014/mcmahan2014colt-unconstrained/}
}