Liu, Jiate

1 publications

TMLR 2023 RLTF: Reinforcement Learning from Unit Test Feedback Jiate Liu, Yiqin Zhu, Kaiwen Xiao, Qiang Fu, Xiao Han, Yang Wei, Deheng Ye