Zeng, Weihao

5 publications

ICLR 2025 AgentRefine: Enhancing Agent Generalization Through Refinement Tuning Dayuan Fu, Keqing He, Yejie Wang, Wentao Hong, Zhuoma GongQue, Weihao Zeng, Wei Wang, Jingang Wang, Xunliang Cai, Weiran Xu
ICLR 2025 B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners Weihao Zeng, Yuzhen Huang, Lulu Zhao, Yijun Wang, Zifei Shan, Junxian He
ICLR 2025 CS-Bench: A Comprehensive Benchmark for Large Language Models Towards Computer Science Mastery Xiaoshuai Song, Muxi Diao, Guanting Dong, Zhengyang Wang, Yujia Fu, Runqi Qiao, Zhexu Wang, Dayuan Fu, Huangxuan Wu, Bin Liang, Weihao Zeng, Yejie Wang, Zhuoma GongQue, Jianing Yu, Qiuna Tan, Weiran Xu
AAAI 2025 CareBot: A Pioneering Full-Process Open-Source Medical Language Model Lulu Zhao, Weihao Zeng, Xiaofeng Shi, Hua Zhou
ICLR 2024 What Makes Good Data for Alignment? a Comprehensive Study of Automatic Data Selection in Instruction Tuning Wei Liu, Weihao Zeng, Keqing He, Yong Jiang, Junxian He