Li, Yafu

3 publications

NeurIPS 2025 Learning to Reason Under Off-Policy Guidance Jianhao Yan, Yafu Li, Zican Hu, Zhi Wang, Ganqu Cui, Xiaoye Qu, Yu Cheng, Yue Zhang
ICML 2025 Test-Time Preference Optimization: On-the-Fly Alignment via Iterative Textual Feedback Yafu Li, Xuyang Hu, Xiaoye Qu, Linjie Li, Yu Cheng
ICLR 2024 Understanding In-Context Learning from Repetitions Jianhao Yan, Jin Xu, Chiyu Song, Chenming Wu, Yafu Li, Yue Zhang