A Policy Optimization Approach to the Solution of Unregularized Mean Field Games
Abstract
We study the problem of finding the equilibrium of a mean field game (MFG) -- a policy performing optimally in a Markov decision process (MDP) determined by the mean field, which is a distribution over a population of agents and a function of the policy. Prior solution techniques build upon fixed-point iteration and are only guaranteed to solve a regularized approximation of the problem, with a regularization constant large enough to ensure that the equilibrium is the unique fixed point of a contraction mapping. This leads to a regularized solution that can deviate arbitrarily from the original equilibrium. In this work, for the first time, we demonstrate how direct gradient-based policy optimization instead of fixed-point iteration, may solve the original, unregularized infinite-horizon average-reward MFG. In particular, we propose Accelerated Single-loop Actor Critic Algorithm for Mean Field Games (ASAC-MFG), which by its namesake, is completely data-driven, single-loop, and single-sample-path. We characterize the finite-time and finite-sample convergence of the ASAC-MFG algorithm to a mean field equilibrium building on a novel multi-time-scale analysis without regularization. We support the theoretical results with numerical simulations that illustrate the superior convergence of the proposed algorithm.
Cite
Text
Zeng et al. "A Policy Optimization Approach to the Solution of Unregularized Mean Field Games." ICML 2024 Workshops: RLControlTheory, 2024.Markdown
[Zeng et al. "A Policy Optimization Approach to the Solution of Unregularized Mean Field Games." ICML 2024 Workshops: RLControlTheory, 2024.](https://mlanthology.org/icmlw/2024/zeng2024icmlw-policy/)BibTeX
@inproceedings{zeng2024icmlw-policy,
title = {{A Policy Optimization Approach to the Solution of Unregularized Mean Field Games}},
author = {Zeng, Sihan and Bhatt, Sujay and Koppel, Alec and Ganesh, Sumitra},
booktitle = {ICML 2024 Workshops: RLControlTheory},
year = {2024},
url = {https://mlanthology.org/icmlw/2024/zeng2024icmlw-policy/}
}