Siyu, Chen

1 publications

COLT 2024 Training Dynamics of Multi-Head SoftMax Attention for In-Context Learning: Emergence, Convergence, and Optimality (extended Abstract) Chen Siyu, Sheen Heejune, Wang Tianhao, Yang Zhuoran