Wang, Jianxiang

1 publications

ICLR 2026 Reducing Belief Deviation in Reinforcement Learning for Active Reasoning of LLM Agents Deyu Zou, Yongqiang Chen, Jianxiang Wang, Garry Yang, Mufei Li, Qing Da, James Cheng, Pan Li, Yu Gong