Policy Gradient in Lipschitz Markov Decision Processes

Cite

Text

Pirotta et al. "Policy Gradient in Lipschitz Markov Decision Processes." Machine Learning, 2015. doi:10.1007/S10994-015-5484-1

Markdown

[Pirotta et al. "Policy Gradient in Lipschitz Markov Decision Processes." Machine Learning, 2015.](https://mlanthology.org/mlj/2015/pirotta2015mlj-policy/) doi:10.1007/S10994-015-5484-1

BibTeX

@article{pirotta2015mlj-policy,
  title     = {{Policy Gradient in Lipschitz Markov Decision Processes}},
  author    = {Pirotta, Matteo and Restelli, Marcello and Bascetta, Luca},
  journal   = {Machine Learning},
  year      = {2015},
  pages     = {255-283},
  doi       = {10.1007/S10994-015-5484-1},
  volume    = {100},
  url       = {https://mlanthology.org/mlj/2015/pirotta2015mlj-policy/}
}