Wu, Zhoutong

2 publications

NeurIPS 2025 Improving Model Representation and Reducing KV Cache via Skip Connections with First Value Heads Zhoutong Wu, Yuan Zhang, Yiming Dong, Chenheng Zhang, Cong Fang, Kun Yuan, Zhouchen Lin
NeurIPS 2024 Separation and Bias of Deep Equilibrium Models on Expressivity and Learning Dynamics Zhoutong Wu, Yimu Zhang, Cong Fang, Zhouchen Lin