ML Anthology
Authors
Search
About
Wang, Jianxiang
1 publications
ICLR
2026
Reducing Belief Deviation in Reinforcement Learning for Active Reasoning of LLM Agents
Deyu Zou
,
Yongqiang Chen
,
Jianxiang Wang
,
Garry Yang
,
Mufei Li
,
Qing Da
,
James Cheng
,
Pan Li
,
Yu Gong