ML Anthology
Authors
Search
About
Qiao, Ruixi
2 publications
ICLR
2025
Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model Pretraining
Jie Cheng
,
Ruixi Qiao
,
Yingwei Ma
,
Binhua Li
,
Gang Xiong
,
Qinghai Miao
,
Yongbin Li
,
Yisheng Lv
NeurIPS
2025
Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning
Jie Cheng
,
Gang Xiong
,
Ruixi Qiao
,
Lijun Li
,
Chao Guo
,
Junle Wang
,
Yisheng Lv
,
Fei-Yue Wang