A Conservative Approach for Few-Shot Transfer in Off-Dynamics Reinforcement Learning

Abstract

Infinitely repeated games support equilibrium concepts beyond those present in one-shot games (e.g., cooperation in the prisoner's dilemma). Nonetheless, repeated games fail to capture our real-world intuition for settings with many anonymous agents interacting in pairs. Repeated games with restarts, introduced by Berker and Conitzer, address this concern by giving players the option to restart the game with someone new whenever their partner deviates from an agreed-upon sequence of actions. In their work, they studied symmetric games with symmetric strategies. We significantly extend these results, introducing and analyzing more general notions of equilibria in asymmetric games with restarts. We characterize which goal strategies players can be incentivized to play in equilibrium, and we consider the computational problem of finding such sequences of actions with minimal cost for the agents. We show that this problem is NP-hard in general. However, when the goal sequence maximizes social welfare, we give a pseudo-polynomial time algorithm.

Cite

Text

Daoudi et al. "A Conservative Approach for Few-Shot Transfer in Off-Dynamics Reinforcement Learning." International Joint Conference on Artificial Intelligence, 2024. doi:10.24963/ijcai.2024/430

Markdown

[Daoudi et al. "A Conservative Approach for Few-Shot Transfer in Off-Dynamics Reinforcement Learning." International Joint Conference on Artificial Intelligence, 2024.](https://mlanthology.org/ijcai/2024/daoudi2024ijcai-conservative/) doi:10.24963/ijcai.2024/430

BibTeX

@inproceedings{daoudi2024ijcai-conservative,
  title     = {{A Conservative Approach for Few-Shot Transfer in Off-Dynamics Reinforcement Learning}},
  author    = {Daoudi, Paul and Prieur, Christophe and Robu, Bogdan and Barlier, Merwan and Dos Santos, Ludovic},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2024},
  pages     = {3890-3898},
  doi       = {10.24963/ijcai.2024/430},
  url       = {https://mlanthology.org/ijcai/2024/daoudi2024ijcai-conservative/}
}