Guo, Jiajian

1 publications

ICLR 2025 Advantage-Guided Distillation for Preference Alignment in Small Language Models Shiping Gao, Fanqi Wan, Jiajian Guo, Xiaojun Quan, Qifan Wang