Li, Sibo

1 publications

ICLR 2026 RL of Thoughts: Navigating LLM Reasoning with Inference-Time Reinforcement Learning Qianyue Hao, Sibo Li, Jian Yuan, Yong Li