Sheen, Heejune

3 publications

ICLR 2026 Taming Polysemanticity in LLMs: Theory-Grounded Feature Recovery via Sparse Autoencoders Siyu Chen, Heejune Sheen, Xuyuan Xiong, Tianhao Wang, Zhuoran Yang
NeurIPS 2024 Unveiling Induction Heads: Provable Training Dynamics and Feature Learning in Transformers Siyu Chen, Heejune Sheen, Tianhao Wang, Zhuoran Yang
ICMLW 2024 Unveiling Induction Heads: Provable Training Dynamics and Feature Learning in Transformers Siyu Chen, Heejune Sheen, Tianhao Wang, Zhuoran Yang