Zhou, Ruiyang

2 publications

NeurIPS 2025 ExPO: Unlocking Hard Reasoning with Self-Explanation-Guided Reinforcement Learning Ruiyang Zhou, Shuozhe Li, Amy Zhang, Liu Leqi
NeurIPSW 2024 Personalized Language Modeling from Personalized Human Feedback Xinyu Li, Ruiyang Zhou, Zachary Chase Lipton, Liu Leqi