Potential-Based Algorithms in Online Prediction and Game Theory

Abstract

In this paper we show that several known algorithms for sequential prediction problems (including the quasi-additive family of Grove et al . and Littlestone and Warmuth’s Weighted Majority), for playing iterated games (including Freund and Schapire’s Hedge and MW, as well as the Λ-strategies of Hart and Mas-Colell), and for boosting (including AdaBoost) are special cases of a general decision strategy based on the notion of potential. By analyzing this strategy we derive known performance bounds, as well as new bounds, as simple corollaries of a single general theorem. Besides offering a new and unified view on a large family of algorithms, we establish a connection between potential-based analysis in learning and their counterparts independently developed in game theory. By exploiting this connection, we show that certain learning problems are instances of more general game-theoretic problems. In particular, we describe a notion of generalized regret and show its applications in learning theory.

Cite

Text

Cesa-Bianchi and Lugosi. "Potential-Based Algorithms in Online Prediction and Game Theory." Annual Conference on Computational Learning Theory, 2001. doi:10.1007/3-540-44581-1_4

Markdown

[Cesa-Bianchi and Lugosi. "Potential-Based Algorithms in Online Prediction and Game Theory." Annual Conference on Computational Learning Theory, 2001.](https://mlanthology.org/colt/2001/cesabianchi2001colt-potential/) doi:10.1007/3-540-44581-1_4

BibTeX

@inproceedings{cesabianchi2001colt-potential,
  title     = {{Potential-Based Algorithms in Online Prediction and Game Theory}},
  author    = {Cesa-Bianchi, Nicolò and Lugosi, Gábor},
  booktitle = {Annual Conference on Computational Learning Theory},
  year      = {2001},
  pages     = {48-64},
  doi       = {10.1007/3-540-44581-1_4},
  url       = {https://mlanthology.org/colt/2001/cesabianchi2001colt-potential/}
}