Guo, Yiju

3 publications

ICLR 2026 LaSeR: Reinforcement Learning with Last-Token Self-Rewarding Wenkai Yang, Weijie Liu, Ruobing Xie, Yiju Guo, Lulu Wu, Saiyong Yang, Yankai Lin
NeurIPS 2025 Learning to Focus: Causal Attention Distillation via Gradient‐Guided Token Pruning Yiju Guo, Wenkai Yang, Zexu Sun, Ning Ding, Zhiyuan Liu, Yankai Lin
ICLR 2025 Uncertainty and Influence Aware Reward Model Refinement for Reinforcement Learning from Human Feedback Zexu Sun, Yiju Guo, Yankai Lin, Xu Chen, Qi Qi, Xing Tang, Xiuqiang He, Ji-Rong Wen