Dong, Yiming

7 publications

NeurIPS 2025 AdaMSS: Adaptive Multi-Subspace Approach for Parameter-Efficient Fine-Tuning Jingjing Zheng, Wanglong Lu, Yiming Dong, Chaojie Ji, Yankai Cao, Zhouchen Lin
NeurIPS 2025 Improving Model Representation and Reducing KV Cache via Skip Connections with First Value Heads Zhoutong Wu, Yuan Zhang, Yiming Dong, Chenheng Zhang, Cong Fang, Kun Yuan, Zhouchen Lin
NeurIPS 2025 On the $O(\frac{\sqrt{d}}{K^{1/4}})$ Convergence Rate of AdamW Measured by $\ell_1$ Norm Huan Li, Yiming Dong, Zhouchen Lin
JMLR 2025 On the O(sqrt(d)/T^(1/4)) Convergence Rate of RMSProp and Its Momentum Extension Measured by L_1 Norm Huan Li, Yiming Dong, Zhouchen Lin
NeurIPS 2025 Stepsize Anything: A Unified Learning Rate Schedule for Budgeted-Iteration Training Anda Tang, Yiming Dong, Yutao Zeng, Zhou Xun, Zhouchen Lin
NeurIPS 2021 Efficient Equivariant Network Lingshen He, Yuxuan Chen, Zhengyang Shen, Yiming Dong, Yisen Wang, Zhouchen Lin
NeurIPS 2021 Gauge Equivariant Transformer Lingshen He, Yiming Dong, Yisen Wang, Dacheng Tao, Zhouchen Lin