Luo, Tongxu

1 publications

NeurIPS 2024 Stacking Your Transformers: A Closer Look at Model Growth for Efficient LLM Pre-Training Wenyu Du, Tongxu Luo, Zihan Qiu, Zeyu Huang, Yikang Shen, Reynold Cheng, Yike Guo, Jie Fu