Zou, Menglin

1 publications

NeurIPS 2025 Trust Region Reward Optimization and Proximal Inverse Reward Optimization Algorithm Yang Chen, Menglin Zou, Jiaqi Zhang, Yitan Zhang, Junyi Yang, Gael Gendron, Libo Zhang, Jiamou Liu, Michael J. Witbrock