A New Challenge in Policy Evaluation

Abstract

This paper proposes a new challenge in policy evaluation: to improve the online data efficiency of Monte Carlo methods via information extracted from offline data while maintaining the unbiasedness of Monte Carlo methods.

Cite

Text

Zhang. "A New Challenge in Policy Evaluation." AAAI Conference on Artificial Intelligence, 2023. doi:10.1609/AAAI.V37I13.26832

Markdown

[Zhang. "A New Challenge in Policy Evaluation." AAAI Conference on Artificial Intelligence, 2023.](https://mlanthology.org/aaai/2023/zhang2023aaai-new/) doi:10.1609/AAAI.V37I13.26832

BibTeX

@inproceedings{zhang2023aaai-new,
  title     = {{A New Challenge in Policy Evaluation}},
  author    = {Zhang, Shangtong},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2023},
  pages     = {15465},
  doi       = {10.1609/AAAI.V37I13.26832},
  url       = {https://mlanthology.org/aaai/2023/zhang2023aaai-new/}
}