Guokang, Gao

1 publications

ICLRW 2025 Decision Preference Alignment for Large-Scale Agents Based on Reward Model Generation Zheng Jiaoling, Xu Weifeng, Luo Qian, Dang Wanli, Geng Long, Gao Guokang, Ren Yulin, Fan Xingyu