He, Guanzhong

1 publications

ICLR 2026 WebSeer: Training Deeper Search Agents Through Reinforcement Learning with Self-Reflection Guanzhong He, Zhen Yang, Jinxin Liu, Bin Xu, Lei Hou, Juanzi Li