ML Anthology
Authors
Search
About
Guo, Yiju
3 publications
ICLR
2026
LaSeR: Reinforcement Learning with Last-Token Self-Rewarding
Wenkai Yang
,
Weijie Liu
,
Ruobing Xie
,
Yiju Guo
,
Lulu Wu
,
Saiyong Yang
,
Yankai Lin
NeurIPS
2025
Learning to Focus: Causal Attention Distillation via GradientāGuided Token Pruning
Yiju Guo
,
Wenkai Yang
,
Zexu Sun
,
Ning Ding
,
Zhiyuan Liu
,
Yankai Lin
ICLR
2025
Uncertainty and Influence Aware Reward Model Refinement for Reinforcement Learning from Human Feedback
Zexu Sun
,
Yiju Guo
,
Yankai Lin
,
Xu Chen
,
Qi Qi
,
Xing Tang
,
Xiuqiang He
,
Ji-Rong Wen