Cao, Yuhan

2 publications

ICLR 2026 RL Grokking Recipe: How Does RL Unlock and Transfer New Algorithms in LLMs? Yiyou Sun, Yuhan Cao, Pohao Huang, Haoyue Bai, Hannaneh Hajishirzi, Nouha Dziri, Dawn Song
AAAI 2024 Double Auction on Diffusion Network Miao Li, Yuhan Cao, Dengji Zhao