Zhao, Weilin

4 publications

ICLR 2026 InfLLM-V2: Dense-Sparse Switchable Attention for Seamless Short-to-Long Adaptation Weilin Zhao, Zihan Zhou, Zhou Su, Chaojun Xiao, Yuxuan Li, Yanghao Li, Yudi Zhang, Weilun Zhao, Zhen Li, Yuxiang Huang, Ao Sun, Xu Han, Zhiyuan Liu
ICLR 2024 Predicting Emergent Abilities with Infinite Resolution Evaluation Shengding Hu, Xin Liu, Xu Han, Xinrong Zhang, Chaoqun He, Weilin Zhao, Yankai Lin, Ning Ding, Zebin Ou, Guoyang Zeng, Zhiyuan Liu, Maosong Sun
NeurIPS 2023 H3T: Efficient Integration of Memory Optimization and Parallelism for Large-Scale Transformer Training Yuzhong Wang, Xu Han, Weilin Zhao, Guoyang Zeng, Zhiyuan Liu, Maosong Sun
NeurIPS 2022 Moderate-Fitting as a Natural Backdoor Defender for Pre-Trained Language Models Biru Zhu, Yujia Qin, Ganqu Cui, Yangyi Chen, Weilin Zhao, Chong Fu, Yangdong Deng, Zhiyuan Liu, Jingang Wang, Wei Wu, Maosong Sun, Ming Gu