Zheng, Zangwei

11 publications

ICML 2025 DSP: Dynamic Sequence Parallelism for Multi-Dimensional Transformers Xuanlei Zhao, Shenggan Cheng, Chang Chen, Zangwei Zheng, Ziming Liu, Zheming Yang, Yang You
ICML 2025 MERIT: Maximum-Normalized Element-Wise Ratio for Language Model Large-Batch Training Yang Luo, Zangwei Zheng, Ziheng Qin, Zirui Zhu, Yong Liu, Yang You
ECCV 2024 Dataset Growth Ziheng Qin, Zhaopan Xu, YuKun Zhou, Kai Wang, Zangwei Zheng, Zebang Cheng, Hao Tang, Lei Shang, Baigui Sun, Radu Timofte, Xiaojiang Peng, Hongxun Yao, Yang You
ICLR 2024 InfoBatch: Lossless Training Speed up by Unbiased Dynamic Data Pruning Ziheng Qin, Kai Wang, Zangwei Zheng, Jianyang Gu, Xiangyu Peng, xu Zhao Pan, Daquan Zhou, Lei Shang, Baigui Sun, Xuansong Xie, Yang You
ICML 2024 OpenMoE: An Early Effort on Open Mixture-of-Experts Language Models Fuzhao Xue, Zian Zheng, Yao Fu, Jinjie Ni, Zangwei Zheng, Wangchunshu Zhou, Yang You
ICML 2023 A Study on Transformer Configuration and Training Objective Fuzhao Xue, Jianghai Chen, Aixin Sun, Xiaozhe Ren, Zangwei Zheng, Xiaoxin He, Yongming Chen, Xin Jiang, Yang You
AAAI 2023 CowClip: Reducing CTR Prediction Model Training Time from 12 Hours to 10 Minutes on 1 GPU Zangwei Zheng, Pengtai Xu, Xuan Zou, Da Tang, Zhen Li, Chenguang Xi, Peng Wu, Leqi Zou, Yijie Zhu, Ming Chen, Xiangzhuo Ding, Fuzhao Xue, Ziheng Qin, Youlong Cheng, Yang You
ICCV 2023 Preventing Zero-Shot Transfer Degradation in Continual Learning of Vision-Language Models Zangwei Zheng, Mingyuan Ma, Kai Wang, Ziheng Qin, Xiangyu Yue, Yang You
NeurIPS 2023 Response Length Perception and Sequence Scheduling: An LLM-Empowered LLM Inference Pipeline Zangwei Zheng, Xiaozhe Ren, Fuzhao Xue, Yang Luo, Xin Jiang, Yang You
NeurIPS 2023 To Repeat or Not to Repeat: Insights from Scaling LLM Under Token-Crisis Fuzhao Xue, Yao Fu, Wangchunshu Zhou, Zangwei Zheng, Yang You
CVPR 2021 Prototypical Cross-Domain Self-Supervised Learning for Few-Shot Unsupervised Domain Adaptation Xiangyu Yue, Zangwei Zheng, Shanghang Zhang, Yang Gao, Trevor Darrell, Kurt Keutzer, Alberto Sangiovanni Vincentelli