Wang, Zhongruo

1 publications

ICLRW 2025 Reinforcement Learning in Inference Time: A Perspective from Successive Policy Iterations Xinnan Zhang, Chenliang Li, Siliang Zeng, Jiaxiang Li, Zhongruo Wang, Songtao Lu, Alfredo Garcia, Mingyi Hong