Neural Policy Gradient Methods: Global Optimality and Rates of Convergence

Cite

Text

Wang et al. "Neural Policy Gradient Methods: Global Optimality and Rates of Convergence." International Conference on Learning Representations, 2020.

Markdown

[Wang et al. "Neural Policy Gradient Methods: Global Optimality and Rates of Convergence." International Conference on Learning Representations, 2020.](https://mlanthology.org/iclr/2020/wang2020iclr-neural/)

BibTeX

@inproceedings{wang2020iclr-neural,
  title     = {{Neural Policy Gradient Methods: Global Optimality and Rates of Convergence}},
  author    = {Wang, Lingxiao and Cai, Qi and Yang, Zhuoran and Wang, Zhaoran},
  booktitle = {International Conference on Learning Representations},
  year      = {2020},
  url       = {https://mlanthology.org/iclr/2020/wang2020iclr-neural/}
}