Luo et al. "Learning Self-Correctable Policies and Value Functions from Demonstrations with Negative Sampling." International Conference on Learning Representations, 2020.
Markdown
[Luo et al. "Learning Self-Correctable Policies and Value Functions from Demonstrations with Negative Sampling." International Conference on Learning Representations, 2020.](https://mlanthology.org/iclr/2020/luo2020iclr-learning/)
BibTeX
@inproceedings{luo2020iclr-learning,
title = {{Learning Self-Correctable Policies and Value Functions from Demonstrations with Negative Sampling}},
author = {Luo, Yuping and Xu, Huazhe and Ma, Tengyu},
booktitle = {International Conference on Learning Representations},
year = {2020},
url = {https://mlanthology.org/iclr/2020/luo2020iclr-learning/}
}