Chen, Xingwu

4 publications

NeurIPS 2025 On the Robustness of Transformers Against Context Hijacking for Linear Classification Tianle Li, Chenyang Zhang, Xingwu Chen, Yuan Cao, Difan Zou
NeurIPS 2024 How Transformers Utilize Multi-Head Attention in In-Context Learning? a Case Study on Sparse Linear Regression Xingwu Chen, Lei Zhao, Difan Zou
ICMLW 2024 How Transformers Utilize Multi-Head Attention in In-Context Learning? a Case Study on Sparse Linear Regression Xingwu Chen, Lei Zhao, Difan Zou
ICML 2024 What Can Transformer Learn with Varying Depth? Case Studies on Sequence Learning Tasks Xingwu Chen, Difan Zou