ML Anthology
Authors
Search
About
He, Weilei
2 publications
ICLR
2025
CREAM: Consistency Regularized Self-Rewarding Language Models
Zhaoyang Wang
,
Weilei He
,
Zhiyuan Liang
,
Xuchao Zhang
,
Chetan Bansal
,
Ying Wei
,
Weitong Zhang
,
Huaxiu Yao
NeurIPSW
2024
Cream: Consistency Regularized Self-Rewarding Language Models
Zhaoyang Wang
,
Weilei He
,
Zhiyuan Liang
,
Xuchao Zhang
,
Chetan Bansal
,
Ying Wei
,
Weitong Zhang
,
Huaxiu Yao