Finite-Time Analysis of Actor-Critic Methods with Deep Neural Network Approximation

Abstract

Actor–critic (AC) algorithms underpin many of today’s most successful reinforcement learning (RL) applications, yet their finite-time convergence in realistic settings remains largely underexplored. Existing analyses often rely on oversimplified formulations and are largely confined to linear function approximation. In practice, however, nonlinear approximations with deep neural networks dominate AC implementations, leaving a substantial gap between theory and practice. In this work, we provide the first finite-time analysis of single-timescale AC with deep neural network approximation in continuous state-action spaces. In particular, we consider the challenging time-average reward setting, where one needs to simultaneously control three highly-coupled error terms including the reward error, the critic error, and the actor error. Our novel analysis is able to establish convergence to a stationary point at a rate $\widetilde{\mathcal{O}}(T^{-1/2})$, where $T$ denotes the total number of iterations, thereby providing theoretical grounding for widely used deep AC methods. We substantiate these theoretical guarantees with experiments that confirm the proven convergence rate and further demonstrate strong performance on MuJoCo benchmarks.

Cite

Text

Chen et al. "Finite-Time Analysis of Actor-Critic Methods with Deep Neural Network Approximation." International Conference on Learning Representations, 2026.

Markdown

[Chen et al. "Finite-Time Analysis of Actor-Critic Methods with Deep Neural Network Approximation." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/chen2026iclr-finitetime/)

BibTeX

@inproceedings{chen2026iclr-finitetime,
  title     = {{Finite-Time Analysis of Actor-Critic Methods with Deep Neural Network Approximation}},
  author    = {Chen, Xuyang and Zhang, Fengzhuo and Yan, Keyu and Zhao, Lin},
  booktitle = {International Conference on Learning Representations},
  year      = {2026},
  url       = {https://mlanthology.org/iclr/2026/chen2026iclr-finitetime/}
}