Fan, Zhizhen

1 publications

ICLR 2025 Self-Evolved Reward Learning for LLMs Chenghua Huang, Zhizhen Fan, Lu Wang, Fangkai Yang, Pu Zhao, Zeqi Lin, Qingwei Lin, Dongmei Zhang, Saravan Rajmohan, Qi Zhang