On the Convergence of No-Regret Learning in Selfish Routing

Abstract

We study the repeated, non-atomic routing game, in which selfish players make a sequence of routing decisions. We consider a model in which players use regret-minimizing algorithms as the learning mechanism, and study the resulting dynamics. We are concerned in particular with the convergence to the set of Nash equilibria of the routing game. No-regret learning algorithms are known to guarantee convergence of a subsequence of population strategies. We are concerned with convergence of the actual sequence. We show that convergence holds for a large class of online learning algorithms, inspired from the continuous-time replicator dynamics. In particular, the discounted Hedge algorithm is proved to belong to this class, which guarantees its convergence.

Cite

Text

Krichene et al. "On the Convergence of No-Regret Learning in Selfish Routing." International Conference on Machine Learning, 2014.

Markdown

[Krichene et al. "On the Convergence of No-Regret Learning in Selfish Routing." International Conference on Machine Learning, 2014.](https://mlanthology.org/icml/2014/krichene2014icml-convergence/)

BibTeX

@inproceedings{krichene2014icml-convergence,
  title     = {{On the Convergence of No-Regret Learning in Selfish Routing}},
  author    = {Krichene, Walid and Drighès, Benjamin and Bayen, Alexandre},
  booktitle = {International Conference on Machine Learning},
  year      = {2014},
  pages     = {163-171},
  volume    = {32},
  url       = {https://mlanthology.org/icml/2014/krichene2014icml-convergence/}
}