Wan, Fanqi

4 publications

ICLR 2025 Advantage-Guided Distillation for Preference Alignment in Small Language Models Shiping Gao, Fanqi Wan, Jiajian Guo, Xiaojun Quan, Qifan Wang
AAAI 2025 Empowering Self-Learning of LLMs: Inner Knowledge Explicitation as a Catalyst Shijue Huang, Wanjun Zhong, Deng Cai, Fanqi Wan, Chengyi Wang, Mingxuan Wang, Mu Qiao, Ruifeng Xu
ICLR 2025 Weighted-Reward Preference Optimization for Implicit Model Fusion Ziyi Yang, Fanqi Wan, Longguang Zhong, Tianyuan Shi, Xiaojun Quan
ICLR 2024 Knowledge Fusion of Large Language Models Fanqi Wan, Xinting Huang, Deng Cai, Xiaojun Quan, Wei Bi, Shuming Shi