He, Weilei

2 publications

ICLR 2025 CREAM: Consistency Regularized Self-Rewarding Language Models Zhaoyang Wang, Weilei He, Zhiyuan Liang, Xuchao Zhang, Chetan Bansal, Ying Wei, Weitong Zhang, Huaxiu Yao
NeurIPSW 2024 Cream: Consistency Regularized Self-Rewarding Language Models Zhaoyang Wang, Weilei He, Zhiyuan Liang, Xuchao Zhang, Chetan Bansal, Ying Wei, Weitong Zhang, Huaxiu Yao