ZhouXiang

1 publications

ICLR 2025 Earlier Tokens Contribute More: Learning Direct Preference Optimization from Temporal Decay Perspective Ruichen Shao, Bei Li, Gangao Liu, Yang Chen, ZhouXiang, Jingang Wang, Xunliang Cai, Peng Li