Sun, Yuting

1 publications

TMLR 2025 Segmenting Text and Learning Their Rewards for Improved RLHF in Language Model Yueqin Yin, Shentao Yang, Yujia Xie, Ziyi Yang, Yuting Sun, Hany Hassan Awadalla, Weizhu Chen, Mingyuan Zhou