Song, Kefan

1 publications

ICLR 2026 Reward Is Enough: LLMs Are In-Context Reinforcement Learners Kefan Song, Amir Moeini, Peng Wang, Lei Gong, Rohan Chandra, Shangtong Zhang, Yanjun Qi