Hong, Mao

3 publications

ICLR 2024 A Policy Gradient Method for Confounded POMDPs Mao Hong, Zhengling Qi, Yanxun Xu
TMLR 2024 MoMA: Model-Based Mirror Ascent for Offline Reinforcement Learning Mao Hong, Zhiyue Zhang, Yue Wu, Yanxun Xu
ICML 2024 Model-Based Reinforcement Learning for Confounded POMDPs Mao Hong, Zhengling Qi, Yanxun Xu