Scalable Spectral Representations for Multiagent Reinforcement Learning in Network MDPs

Abstract

Network Markov Decision Processes (MDPs), which are the de-facto model for multi-agent control, pose a significant challenge to efficient learning caused by the exponential growth of the global state-action space with the number of agents. In this work, utilizing the exponential decay property of network dynamics, we first derive scalable spectral local representations for multiagent reinforcement learning in network MDPs, which induces a network linear subspace for the local $Q$-function of each agent. Building on these local spectral representations, we design a scalable algorithmic framework for multiagent reinforcement learning in continuous state-action network MDPs, and provide end-to-end guarantees for the convergence of our algorithm. Empirically, we validate the effectiveness of our scalable representation-based approach on two benchmark problems, and demonstrate the advantages of our approach over generic function approximation approaches to representing the local $Q$-functions.

Cite

Text

Ren et al. "Scalable Spectral Representations for Multiagent Reinforcement Learning in Network MDPs." Proceedings of The 28th International Conference on Artificial Intelligence and Statistics, 2025.

Markdown

[Ren et al. "Scalable Spectral Representations for Multiagent Reinforcement Learning in Network MDPs." Proceedings of The 28th International Conference on Artificial Intelligence and Statistics, 2025.](https://mlanthology.org/aistats/2025/ren2025aistats-scalable/)

BibTeX

@inproceedings{ren2025aistats-scalable,
  title     = {{Scalable Spectral Representations for Multiagent Reinforcement Learning in Network MDPs}},
  author    = {Ren, Zhaolin and Zhang, Runyu and Dai, Bo and Li, Na},
  booktitle = {Proceedings of The 28th International Conference on Artificial Intelligence and Statistics},
  year      = {2025},
  pages     = {550-558},
  volume    = {258},
  url       = {https://mlanthology.org/aistats/2025/ren2025aistats-scalable/}
}