Han, Jinyi

4 publications

ICLR 2026 A Stitch in Time Saves Nine: Proactive Self-Refinement for Language Models Jinyi Han, Xinyi Wang, Haiquan Zhao, Tingyun Li, Zishang Jiang, Sihang Jiang, Jiaqing Liang, Xin Alex Lin, Weikang Zhou, Zeye Sun, Fei Yu, Yanghua Xiao
ICLR 2026 Selective Expert Guidance for Effective and Diverse Exploration in Reinforcement Learning of LLMs Zishang Jiang, Jinyi Han, Tingyun Li, Xinyi Wang, Sihang Jiang, Zhaoqian Dai, Ma Shuguang, Fei Yu, Jiaqing Liang, Yanghua Xiao
ICLR 2026 Your Models Have Thought Enough: Training Large Reasoning Models to Stop Overthinking Jinyi Han, Ying Huang, Ying Liao, Haiquan Zhao, Zishang Jiang, Xinyi Wang, Xikun Lu, Guanghao Zhou, Sihang Jiang, Jiaqing Liang, Weikang Zhou, Zeye Sun, Fei Yu, Yanghua Xiao
ICLR 2025 Think Thrice Before You Act: Progressive Thought Refinement in Large Language Models Chengyu Du, Jinyi Han, Yizhou Ying, Aili Chen, Qianyu He, Haokun Zhao, Haoran Guo, Sirui Xia, Jiaqing Liang, Zulong Chen, Liangyue Li, Yanghua Xiao