Ma, Jason Yecheng

2 publications

NeurIPS 2022 Offline Goal-Conditioned Reinforcement Learning via $f$-Advantage Regression Jason Yecheng Ma, Jason Yan, Dinesh Jayaraman, Osbert Bastani
NeurIPS 2022 Regret Bounds for Risk-Sensitive Reinforcement Learning Osbert Bastani, Jason Yecheng Ma, Estelle Shen, Wanqiao Xu