Huang, Pohao

1 publications

ICLR 2026 RL Grokking Recipe: How Does RL Unlock and Transfer New Algorithms in LLMs? Yiyou Sun, Yuhan Cao, Pohao Huang, Haoyue Bai, Hannaneh Hajishirzi, Nouha Dziri, Dawn Song