Learning Self-Correctable Policies and Value Functions from Demonstrations with Negative Sampling

Cite

Text

Luo et al. "Learning Self-Correctable Policies and Value Functions from Demonstrations with Negative Sampling." International Conference on Learning Representations, 2020.

Markdown

[Luo et al. "Learning Self-Correctable Policies and Value Functions from Demonstrations with Negative Sampling." International Conference on Learning Representations, 2020.](https://mlanthology.org/iclr/2020/luo2020iclr-learning/)

BibTeX

@inproceedings{luo2020iclr-learning,
  title     = {{Learning Self-Correctable Policies and Value Functions from Demonstrations with Negative Sampling}},
  author    = {Luo, Yuping and Xu, Huazhe and Ma, Tengyu},
  booktitle = {International Conference on Learning Representations},
  year      = {2020},
  url       = {https://mlanthology.org/iclr/2020/luo2020iclr-learning/}
}