Gu, Yuzhe

5 publications

ICLR 2026 MM-HELIX: Boosting Multimodal Long-Chain Reflective Reasoning with Holistic Platform and Adaptive Hybrid Policy Optimization Xiangyu Zhao, Junming Lin, Tianhao Liang, Yifan Zhou, Wenhao Chai, Yuzhe Gu, Weiyun Wang, Kai Chen, Gen Luo, Junchi Yan, Wenwei Zhang, Hua Yang, Haodong Duan, Xue Yang
ICLR 2026 The Imitation Game: Turing Machine Imitator Is Length Generalizable Reasoner Zhouqi Hua, Wenwei Zhang, Chengqi Lyu, Yuzhe Gu, Songyang Gao, Kuikun Liu, Dahua Lin, Kai Chen
ICLR 2025 Mask-DPO: Generalizable Fine-Grained Factuality Alignment of LLMs Yuzhe Gu, Wenwei Zhang, Chengqi Lyu, Dahua Lin, Kai Chen
NeurIPS 2025 Semi-Off-Policy Reinforcement Learning for Vision-Language Slow-Thinking Reasoning Junhao Shen, Haiteng Zhao, Yuzhe Gu, Songyang Gao, Kuikun Liu, Haian Huang, Jianfei Gao, Dahua Lin, Wenwei Zhang, Kai Chen
NeurIPS 2024 ANAH-V2: Scaling Analytical Hallucination Annotation of Large Language Models Yuzhe Gu, Ziwei Ji, Wenwei Zhang, Chengqi Lyu, Dahua Lin, Kai Chen