Chen, Yanfeng

2 publications

ICLR 2026 Fast Data Mixture Optimization via Gradient Descent Haoru Tan, Sitong Wu, Yanfeng Chen, Jun Xia, Ruobing Xie, Bin Xia, Xingwu Sun, Xiaojuan Qi
NeurIPS 2025 Understanding Data Influence in Reinforcement Finetuning Haoru Tan, Xiuzhe Wu, Sitong Wu, Shaofeng Zhang, Yanfeng Chen, Xingwu Sun, Jeanne Shen, Xiaojuan Qi