Zeng, Yongcheng

4 publications

ICLRW 2025 ARIES: Stimulating Self-Refinement of Large Language Models with and for Iterative Preference Optimization Yongcheng Zeng, Xuanfa Jin, Guoqing Liu, Quan He, Dong Li, Jianye Hao, Haifeng Zhang, Jun Wang
ICLRW 2025 Enhancing Mathematical Reasoning in Language Models Through Focused Differentiation Training Zhiyu Zhao, Yongcheng Zeng, Ning Yang, Zihan Zhao, Haifeng Zhang, Jun Wang, Guoqing Liu
NeurIPS 2024 Large Language Models Play StarCraft II:Benchmarks and a Chain of Summarization Approach Weiyu Ma, Qirui Mi, Yongcheng Zeng, Xue Yan, Yuqiao Wu, Runji Lin, Haifeng Zhang, Jun Wang
ICML 2024 Token-Level Direct Preference Optimization Yongcheng Zeng, Guoqing Liu, Weiyu Ma, Ning Yang, Haifeng Zhang, Jun Wang