Miao, Xupeng

9 publications

ICML 2025 Demystifying Cost-Efficiency in LLM Serving over Heterogeneous GPUs Youhe Jiang, Fangcheng Fu, Xiaozhe Yao, Guoliang He, Xupeng Miao, Ana Klimovic, Bin Cui, Binhang Yuan, Eiko Yoneki
ICLR 2025 NetMoE: Accelerating MoE Training Through Dynamic Sample Placement Xinyi Liu, Yujie Wang, Fangcheng Fu, Xupeng Miao, Shenhan Zhu, Xiaonan Nie, Bin Cui
AAAI 2024 Accelerating Text-to-Image Editing via Cache-Enabled Sparse Diffusion Inference Zihao Yu, Haoyang Li, Fangcheng Fu, Xupeng Miao, Bin Cui
NeurIPS 2024 LSH-MoE: Communication-Efficient MoE Training via Locality-Sensitive Hashing Xiaonan Nie, Qibin Liu, Fangcheng Fu, Shenhan Zhu, Xupeng Miao, Xiaoyang Li, Yang Zhang, Shouda Liu, Bin Cui
IJCAI 2024 X-Former Elucidator: Reviving Efficient Attention for Long Context Language Modeling Xupeng Miao, Shenhan Zhu, Fangcheng Fu, Ziyu Guo, Zhi Yang, Yaofeng Tu, Zhihao Jia, Bin Cui
AAAI 2023 CALIP: Zero-Shot Enhancement of CLIP with Parameter-Free Attention Ziyu Guo, Renrui Zhang, Longtian Qiu, Xianzheng Ma, Xupeng Miao, Xuming He, Bin Cui
NeurIPS 2023 Model-Enhanced Vector Index Hailin Zhang, Yujing Wang, Qi Chen, Ruiheng Chang, Ting Zhang, Ziming Miao, Yingyan Hou, Yang Ding, Xupeng Miao, Haonan Wang, Bochen Pang, Yuefeng Zhan, Hao Sun, Weiwei Deng, Qi Zhang, Fan Yang, Xing Xie, Mao Yang, Bin Cui
IJCAI 2023 OSDP: Optimal Sharded Data Parallel for Distributed Deep Learning Youhe Jiang, Fangcheng Fu, Xupeng Miao, Xiaonan Nie, Bin Cui
CVPR 2022 PointCLIP: Point Cloud Understanding by CLIP Renrui Zhang, Ziyu Guo, Wei Zhang, Kunchang Li, Xupeng Miao, Bin Cui, Yu Qiao, Peng Gao, Hongsheng Li