Zeng, Yutao

7 publications

NeurIPS 2025 HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization Zhijian Zhuo, Yutao Zeng, Ya Wang, Sijun Zhang, Xiaoqing Li, Jian Yang, Zhou Xun, Jinwen Ma
ICLR 2025 Hyper-Connections Defa Zhu, Hongzhi Huang, Zihao Huang, Yutao Zeng, Yunyao Mao, Banggu Wu, Qiyang Min, Xun Zhou
ICML 2025 Over-Tokenized Transformer: Vocabulary Is Generally Worth Scaling Hongzhi Huang, Defa Zhu, Banggu Wu, Yutao Zeng, Ya Wang, Qiyang Min, Zhou Xun
ICLR 2025 Polynomial Composition Activations: Unleashing the Dynamics of Large Language Models Zhijian Zhuo, Ya Wang, Yutao Zeng, Xiaoqing Li, Xun Zhou, Jinwen Ma
ICCV 2025 SimpleVQA: Multimodal Factuality Evaluation for Multimodal Large Language Models Xianfu Cheng, Wei Zhang, Shiwei Zhang, Jian Yang, Xiangyuan Guan, Xianjie Wu, Xiang Li, Ge Zhang, Jiaheng Liu, Yuying Mai, Yutao Zeng, Zhoufutu Wen, Ke Jin, Baorui Wang, Weixiao Zhou, Yunhong Lu, Hangyuan Ji, Tongliang Li, Wenhao Huang, Zhoujun Li
NeurIPS 2025 Stepsize Anything: A Unified Learning Rate Schedule for Budgeted-Iteration Training Anda Tang, Yiming Dong, Yutao Zeng, Zhou Xun, Zhouchen Lin
ICLR 2025 Ultra-Sparse Memory Network Zihao Huang, Qiyang Min, Hongzhi Huang, Yutao Zeng, Defa Zhu, Ran Guo, Zhou Xun