ML Anthology
Authors
Search
About
Ye, Jiaquan
1 publications
ICLR
2025
Taming Transformer Without Using Learning Rate Warmup
Xianbiao Qi
,
Yelin He
,
Jiaquan Ye
,
Chun-Guang Li
,
Bojia Zi
,
Xili Dai
,
Qin Zou
,
Rong Xiao