Free-Energy Equilibria: Toward a Theory of Interactions Between Boundedly-Rational Agents
Abstract
We propose a novel framework for modelling strategic interactions between boundedly-rational agents in complex, partially observable environments. Our approach introduces agents that minimize a free-energy functional, capturing the divergence between their beliefs about future trajectories and their preferences, which are represented by a biased probabilistic model. We extend this to multi-agent settings and introduce Free-Energy Equilibria, a new class of game-theoretic solution concepts. We begin by establishing the relationship between Free-Energy Equilibria and existing game-theoretic solution concepts. Then, we propose an approach to studying cooperation by contrasting Free-Energy Equilibria with joint free-energy minimization, extending key concepts from mechanism design. Our framework allows for modelling interactions between agents with varying levels of rationality and biased or incorrect world models, providing insights into human-AI interaction and AI alignment.
Cite
Text
Hyland et al. "Free-Energy Equilibria: Toward a Theory of Interactions Between Boundedly-Rational Agents." ICML 2024 Workshops: MFHAIA, 2024.Markdown
[Hyland et al. "Free-Energy Equilibria: Toward a Theory of Interactions Between Boundedly-Rational Agents." ICML 2024 Workshops: MFHAIA, 2024.](https://mlanthology.org/icmlw/2024/hyland2024icmlw-freeenergy/)BibTeX
@inproceedings{hyland2024icmlw-freeenergy,
title = {{Free-Energy Equilibria: Toward a Theory of Interactions Between Boundedly-Rational Agents}},
author = {Hyland, David and Gavenčiak, Tomáš and Da Costa, Lancelot and Heins, Conor and Kovarik, Vojtech and Gutierrez, Julian and Wooldridge, Michael J. and Kulveit, Jan},
booktitle = {ICML 2024 Workshops: MFHAIA},
year = {2024},
url = {https://mlanthology.org/icmlw/2024/hyland2024icmlw-freeenergy/}
}