Li, Shuozhe

2 publications

ICLR 2025 An Optimal Discriminator Weighted Imitation Perspective for Reinforcement Learning Haoran Xu, Shuozhe Li, Harshit Sikchi, Scott Niekum, Amy Zhang
NeurIPS 2025 ExPO: Unlocking Hard Reasoning with Self-Explanation-Guided Reinforcement Learning Ruiyang Zhou, Shuozhe Li, Amy Zhang, Liu Leqi