Single-Loop Federated Actor-Critic Across Heterogeneous Environments

Abstract

Federated reinforcement learning (FRL) has emerged as a promising paradigm, enabling multiple agents to collaborate and learn a shared policy adaptable across heterogeneous environments. Among the various reinforcement learning (RL) algorithms, the actor-critic (AC) algorithm stands out for its low variance and high sample efficiency. However, little to nothing is known theoretically about AC in a federated manner, especially each agent interacts with a potentially different environment. The lack of such results is attributed to various technical challenges: a two-level structure illustrating the coupling effect between the actor and the critic, heterogeneous environments, Markovian sampling and multiple local updates. In response, we study Single-Loop Federated Actor Critic (SFAC) where agents perform AC learning in a two-level federated manner while interacting with heterogeneous environments. We then provide bounds on the convergence error of SFAC. The results show that the convergence error asymptotically converges to a near-stationary point, with the extent proportional to environment heterogeneity. Moreover, the sample complexity exhibits a linear speed-up through the federation of agents. We evaluate the performance of SFAC through numerical experiments using common RL benchmarks, which demonstrate its effectiveness.

Cite

Text

Zhu and Gong. "Single-Loop Federated Actor-Critic Across Heterogeneous Environments." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I21.34469

Markdown

[Zhu and Gong. "Single-Loop Federated Actor-Critic Across Heterogeneous Environments." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/zhu2025aaai-single/) doi:10.1609/AAAI.V39I21.34469

BibTeX

@inproceedings{zhu2025aaai-single,
  title     = {{Single-Loop Federated Actor-Critic Across Heterogeneous Environments}},
  author    = {Zhu, Ye and Gong, Xiaowen},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2025},
  pages     = {23054-23062},
  doi       = {10.1609/AAAI.V39I21.34469},
  url       = {https://mlanthology.org/aaai/2025/zhu2025aaai-single/}
}