Wang, Yuanfu

2 publications

ICLR 2026 Native Reasoning Models: Training Language Models to Reason on Unverifiable Data Yuanfu Wang, Zhixuan Liu, Li Xiangtian, Chaochao Lu, Chao Yang
AAAI 2024 Critic-Guided Decision Transformer for Offline Reinforcement Learning Yuanfu Wang, Chao Yang, Ying Wen, Yu Liu, Yu Qiao