Tu, Zhaopeng

31 publications

ICLR 2025 Competing Large Language Models in Multi-Agent Gaming Environments Jen-tse Huang, Eric John Li, Man Ho Lam, Tian Liang, Wenxuan Wang, Youliang Yuan, Wenxiang Jiao, Xing Wang, Zhaopeng Tu, Michael Lyu
ICML 2025 Critical Tokens Matter: Token-Level Contrastive Estimation Enhances LLM’s Reasoning Capability Zicheng Lin, Tian Liang, Jiahao Xu, Qiuzhi Liu, Xing Wang, Ruilin Luo, Chufan Shi, Siheng Li, Yujiu Yang, Zhaopeng Tu
ICML 2025 Do NOT Think That Much for 2+3=? on the Overthinking of Long Reasoning Models Xingyu Chen, Jiahao Xu, Tian Liang, Zhiwei He, Jianhui Pang, Dian Yu, Linfeng Song, Qiuzhi Liu, Mengfei Zhou, Zhuosheng Zhang, Rui Wang, Zhaopeng Tu, Haitao Mi, Dong Yu
ICLR 2025 Dynamic Mixture of Experts: An Auto-Tuning Approach for Efficient Transformer Models Yongxin Guo, Zhenglin Cheng, Xiaoying Tang, Zhaopeng Tu, Tao Lin
ICLR 2025 RaSA: Rank-Sharing Low-Rank Adaptation Zhiwei He, Zhaopeng Tu, Xing Wang, Xingyu Chen, Zhijie Wang, Jiahao Xu, Tian Liang, Wenxiang Jiao, Zhuosheng Zhang, Rui Wang
NeurIPS 2025 SCAN: Self-Denoising Monte Carlo Annotation for Robust Process Reward Learning Yuyang Ding, Xinyu Shi, Juntao Li, Xiaobo Liang, Zhaopeng Tu, Min Zhang
NeurIPS 2025 SPC: Evolving Self-Play Critic via Adversarial Games for LLM Reasoning Jiaqi Chen, Bang Zhang, Ruotian Ma, Peisong Wang, Xiaodan Liang, Zhaopeng Tu, Xiaolong Li, Kwan-Yee K. Wong
NeurIPS 2025 The First Few Tokens Are All You Need: An Efficient and Effective Unsupervised Prefix Fine-Tuning Method for Reasoning Models Ke Ji, Jiahao Xu, Tian Liang, Qiuzhi Liu, Zhiwei He, Xiaoyuan Liu, Xingyu Chen, Junying Chen, Benyou Wang, Zhaopeng Tu, Haitao Mi, Dong Yu
NeurIPS 2025 The Lighthouse of Language: Enhancing LLM Agents via Critique-Guided Improvement Ruihan Yang, Fanghua Ye, Jian Li, Siyu Yuan, Yikai Zhang, Zhaopeng Tu, Xiaolong Li, Deqing Yang
NeurIPS 2025 Thoughts Are All over the Place: On the Underthinking of Long Reasoning Models Yue Wang, Qiuzhi Liu, Jiahao Xu, Tian Liang, Xingyu Chen, Zhiwei He, Linfeng Song, Dian Yu, Juntao Li, Zhuosheng Zhang, Rui Wang, Zhaopeng Tu, Haitao Mi, Dong Yu
NeurIPS 2025 Trust, but Verify: A Self-Verification Approach to Reinforcement Learning with Verifiable Rewards Xiaoyuan Liu, Tian Liang, Zhiwei He, Jiahao Xu, Wenxuan Wang, Pinjia He, Zhaopeng Tu, Haitao Mi, Dong Yu
NeurIPS 2025 Two Experts Are All You Need for Steering Thinking: Reinforcing Cognitive Effort in MoE Reasoning Models Without Additional Training Mengru Wang, Xingyu Chen, Yue Wang, Zhiwei He, Jiahao Xu, Tian Liang, Qiuzhi Liu, Yunzhi Yao, Wenxuan Wang, Ruotian Ma, Haitao Mi, Ningyu Zhang, Zhaopeng Tu, Xiaolong Li, Dong Yu
NeurIPS 2024 Apathetic or Empathetic? Evaluating LLMs' Emotional Alignments with Humans Jen-tse Huang, Man Ho Lam, Eric John Li, Shujie Ren, Wenxuan Wang, Wenxiang Jiao, Zhaopeng Tu, Michael R. Lyu
NeurIPS 2024 Benchmarking LLMs via Uncertainty Quantification Fanghua Ye, Mingming Yang, Jianhui Pang, Longyue Wang, Derek F. Wong, Emine Yilmaz, Shuming Shi, Zhaopeng Tu
ICLR 2024 GPT-4 Is Too Smart to Be Safe: Stealthy Chat with LLMs via Cipher Youliang Yuan, Wenxiang Jiao, Wenxuan Wang, Jen-tse Huang, Pinjia He, Shuming Shi, Zhaopeng Tu
ICML 2024 GliDe with a CaPE: A Low-Hassle Method to Accelerate Speculative Decoding Cunxiao Du, Jing Jiang, Xu Yuanchen, Jiawei Wu, Sicheng Yu, Yongqi Li, Shenggui Li, Kai Xu, Liqiang Nie, Zhaopeng Tu, Yang You
NeurIPS 2024 NewTerm: Benchmarking Real-Time New Terms for Large Language Models with Annual Updates Hexuan Deng, Wenxiang Jiao, Xuebo Liu, Min Zhang, Zhaopeng Tu
ICLR 2024 On the Humanity of Conversational AI: Evaluating the Psychological Portrayal of LLMs Jen-tse Huang, Wenxuan Wang, Eric John Li, Man Ho Lam, Shujie Ren, Youliang Yuan, Wenxiang Jiao, Zhaopeng Tu, Michael Lyu
ICML 2021 Order-Agnostic Cross Entropy for Non-Autoregressive Machine Translation Cunxiao Du, Zhaopeng Tu, Jing Jiang
ICLR 2021 Understanding and Improving Encoder Layer Fusion in Sequence-to-Sequence Learning Xuebo Liu, Longyue Wang, Derek F. Wong, Liang Ding, Lidia S. Chao, Zhaopeng Tu
ICLR 2021 Understanding and Improving Lexical Choice in Non-Autoregressive Translation Liang Ding, Longyue Wang, Xuebo Liu, Derek F. Wong, Dacheng Tao, Zhaopeng Tu
IJCAI 2020 Auxiliary Template-Enhanced Generative Compatibility Modeling Jinhuan Liu, Xuemeng Song, Zhaochun Ren, Liqiang Nie, Zhaopeng Tu, Jun Ma
AAAI 2020 Go from the General to the Particular: Multi-Domain Translation with Domain Transformation Networks Yong Wang, Longyue Wang, Shuming Shi, Victor O. K. Li, Zhaopeng Tu
AAAI 2020 Neuron Interaction Based Representation Composition for Neural Machine Translation Jian Li, Xing Wang, Baosong Yang, Shuming Shi, Michael R. Lyu, Zhaopeng Tu
AAAI 2019 Context-Aware Self-Attention Networks Baosong Yang, Jian Li, Derek F. Wong, Lidia S. Chao, Xing Wang, Zhaopeng Tu
AAAI 2019 Dynamic Layer Aggregation for Neural Machine Translation with Routing-by-Agreement Zi-Yi Dou, Zhaopeng Tu, Xing Wang, Longyue Wang, Shuming Shi, Tong Zhang
AAAI 2019 Neural Machine Translation with Adequacy-Oriented Learning Xiang Kong, Zhaopeng Tu, Shuming Shi, Eduard H. Hovy, Tong Zhang
IJCAI 2018 Neural Machine Translation with Key-Value Memory-Augmented Attention Fandong Meng, Zhaopeng Tu, Yong Cheng, Haiyang Wu, Junjie Zhai, Yuekui Yang, Di Wang
AAAI 2018 Translating Pro-Drop Languages with Reconstruction Models Longyue Wang, Zhaopeng Tu, Shuming Shi, Tong Zhang, Yvette Graham, Qun Liu
AAAI 2017 Neural Machine Translation Advised by Statistical Machine Translation Xing Wang, Zhengdong Lu, Zhaopeng Tu, Hang Li, Deyi Xiong, Min Zhang
AAAI 2017 Neural Machine Translation with Reconstruction Zhaopeng Tu, Yang Liu, Lifeng Shang, Xiaohua Liu, Hang Li