Sun, Ruopei

1 publications

ICLR 2026 Disentangling Length Bias in Preference Learning via Response-Conditioned Modeling Jianfeng Cai, Jinhua Zhu, Ruopei Sun, Yue Wang, Li Li, Wengang Zhou, Houqiang Li