Lyu, Haoming

1 publications

ICLR 2026 SophiaVL-R1: Reinforcing MLLMs Reasoning with Thinking Reward Kaixuan Fan, Kaituo Feng, Haoming Lyu, Dongzhan Zhou, Xiangyu Yue