XiaoLong, Hu

1 publications

ICLR 2026 Detecting Data Contamination from Reinforcement Learning Post-Training for Large Language Models Yongding Tao, Tian Wang, Yihong Dong, Huanyu Liu, Kechi Zhang, Hu XiaoLong, Ge Li