Potential-Based Algorithms in On-Line Prediction and Game Theory
Abstract
In this paper we show that several known algorithms for sequential prediction problems (including Weighted Majority and the quasi-additive family of Grove, Littlestone, and Schuurmans), for playing iterated games (including Freund and Schapire's Hedge and MW, as well as the Λ-strategies of Hart and Mas-Colell), and for boosting (including AdaBoost) are special cases of a general decision strategy based on the notion of potential. By analyzing this strategy we derive known performance bounds, as well as new bounds, as simple corollaries of a single general theorem. Besides offering a new and unified view on a large family of algorithms, we establish a connection between potential-based analysis in learning and their counterparts independently developed in game theory. By exploiting this connection, we show that certain learning problems are instances of more general game-theoretic problems. In particular, we describe a notion of generalized regret andshow its applications in learning theory.
Cite
Text
Cesa-Bianchi and Lugosi. "Potential-Based Algorithms in On-Line Prediction and Game Theory." Machine Learning, 2003. doi:10.1023/A:1022901500417Markdown
[Cesa-Bianchi and Lugosi. "Potential-Based Algorithms in On-Line Prediction and Game Theory." Machine Learning, 2003.](https://mlanthology.org/mlj/2003/cesabianchi2003mlj-potentialbased/) doi:10.1023/A:1022901500417BibTeX
@article{cesabianchi2003mlj-potentialbased,
title = {{Potential-Based Algorithms in On-Line Prediction and Game Theory}},
author = {Cesa-Bianchi, Nicolò and Lugosi, Gábor},
journal = {Machine Learning},
year = {2003},
pages = {239-261},
doi = {10.1023/A:1022901500417},
volume = {51},
url = {https://mlanthology.org/mlj/2003/cesabianchi2003mlj-potentialbased/}
}