Potential-Based Algorithms in On-Line Prediction and Game Theory

Abstract

In this paper we show that several known algorithms for sequential prediction problems (including Weighted Majority and the quasi-additive family of Grove, Littlestone, and Schuurmans), for playing iterated games (including Freund and Schapire's Hedge and MW, as well as the Λ-strategies of Hart and Mas-Colell), and for boosting (including AdaBoost) are special cases of a general decision strategy based on the notion of potential. By analyzing this strategy we derive known performance bounds, as well as new bounds, as simple corollaries of a single general theorem. Besides offering a new and unified view on a large family of algorithms, we establish a connection between potential-based analysis in learning and their counterparts independently developed in game theory. By exploiting this connection, we show that certain learning problems are instances of more general game-theoretic problems. In particular, we describe a notion of generalized regret andshow its applications in learning theory.

Cite

Text

Cesa-Bianchi and Lugosi. "Potential-Based Algorithms in On-Line Prediction and Game Theory." Machine Learning, 2003. doi:10.1023/A:1022901500417

Markdown

[Cesa-Bianchi and Lugosi. "Potential-Based Algorithms in On-Line Prediction and Game Theory." Machine Learning, 2003.](https://mlanthology.org/mlj/2003/cesabianchi2003mlj-potentialbased/) doi:10.1023/A:1022901500417

BibTeX

@article{cesabianchi2003mlj-potentialbased,
  title     = {{Potential-Based Algorithms in On-Line Prediction and Game Theory}},
  author    = {Cesa-Bianchi, Nicolò and Lugosi, Gábor},
  journal   = {Machine Learning},
  year      = {2003},
  pages     = {239-261},
  doi       = {10.1023/A:1022901500417},
  volume    = {51},
  url       = {https://mlanthology.org/mlj/2003/cesabianchi2003mlj-potentialbased/}
}