Exploiting Approximate Symmetry for Efficient Multi-Agent Reinforcement Learning

Abstract

Mean-field games (MFG) have become significant tools for solving large-scale multi-agent reinforcement learning problems under symmetry. However, the assumptions of access to a known MFG model (which might not be available for real-world games) and of exact symmetry (real-world scenarios often feature heterogeneity) limit the applicability of MFGs. In this work, we broaden the applicability of MFGs by providing a methodology to extend any finite-player, possibly asymmetric, game to an “induced MFG”. First, we prove that $N$-player dynamic games can be symmetrized and smoothly extended to the infinite-player continuum via Kirszbraun extensions. Next, we define $\alpha,\beta$-symmetric games, a new class of dynamic games that incorporate approximate permutation invariance. We establish explicit approximation bounds for $\alpha,\beta$-symmetric games, demonstrating that the induced mean-field Nash policy is an approximate Nash of the $N$-player game. We analyze TD learning using sample trajectories of the $N$-player game, permitting learning without using an explicit MFG model or oracle. This is used to show a sample complexity of $\widetilde{\mathcal{O}}(\varepsilon^{-6})$ for $N$-agent monotone extendable games to learn an $\varepsilon$-Nash. Evaluations on benchmarks with thousands of agents support our theory of learning under (approximate) symmetry without explicit MFGs.

Cite

Text

Yardim and He. "Exploiting Approximate Symmetry for Efficient Multi-Agent Reinforcement Learning." Proceedings of the 7th Annual Learning for Dynamics \& Control Conference, 2025.

Markdown

[Yardim and He. "Exploiting Approximate Symmetry for Efficient Multi-Agent Reinforcement Learning." Proceedings of the 7th Annual Learning for Dynamics \& Control Conference, 2025.](https://mlanthology.org/l4dc/2025/yardim2025l4dc-exploiting/)

BibTeX

@inproceedings{yardim2025l4dc-exploiting,
  title     = {{Exploiting Approximate Symmetry for Efficient Multi-Agent Reinforcement Learning}},
  author    = {Yardim, Batuhan and He, Niao},
  booktitle = {Proceedings of the 7th Annual Learning for Dynamics \& Control Conference},
  year      = {2025},
  pages     = {31-44},
  volume    = {283},
  url       = {https://mlanthology.org/l4dc/2025/yardim2025l4dc-exploiting/}
}