Single-Loop Federated Actor-Critic Across Heterogeneous Environments

AAAI 2025 pp. 23054-23062

doi:10.1609/AAAI.V39I21.34469 /aaai/2025/zhu2025aaai-single/

Abstract

Federated reinforcement learning (FRL) has emerged as a promising paradigm, enabling multiple agents to collaborate and learn a shared policy adaptable across heterogeneous environments. Among the various reinforcement learning (RL) algorithms, the actor-critic (AC) algorithm stands out for its low variance and high sample efficiency. However, little to nothing is known theoretically about AC in a federated manner, especially each agent interacts with a potentially different environment. The lack of such results is attributed to various technical challenges: a two-level structure illustrating the coupling effect between the actor and the critic, heterogeneous environments, Markovian sampling and multiple local updates. In response, we study Single-Loop Federated Actor Critic (SFAC) where agents perform AC learning in a two-level federated manner while interacting with heterogeneous environments. We then provide bounds on the convergence error of SFAC. The results show that the convergence error asymptotically converges to a near-stationary point, with the extent proportional to environment heterogeneity. Moreover, the sample complexity exhibits a linear speed-up through the federation of agents. We evaluate the performance of SFAC through numerical experiments using common RL benchmarks, which demonstrate its effectiveness.

PDF AAAI Semantic Scholar

Cite

Text

Zhu and Gong. "Single-Loop Federated Actor-Critic Across Heterogeneous Environments." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I21.34469

Markdown

[Zhu and Gong. "Single-Loop Federated Actor-Critic Across Heterogeneous Environments." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/zhu2025aaai-single/) doi:10.1609/AAAI.V39I21.34469

BibTeX

@inproceedings{zhu2025aaai-single,
  title     = {{Single-Loop Federated Actor-Critic Across Heterogeneous Environments}},
  author    = {Zhu, Ye and Gong, Xiaowen},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2025},
  pages     = {23054-23062},
  doi       = {10.1609/AAAI.V39I21.34469},
  url       = {https://mlanthology.org/aaai/2025/zhu2025aaai-single/}
}