Meng, Chen

3 publications

ICLRW 2024 Distributed Inference Performance Optimization for LLMs on CPUs Pujiang He, Shan Zhou, Changqing Li, Wenhuan Huang, Weifei Yu, Duyi Wang, Chen Meng, Sheng Gui
ICMLW 2024 Inference Performance Optimization for Large Language Models on CPUs Pujiang He, Shan Zhou, Wenhuan Huang, Changqing Li, Duyi Wang, Bin Guo, Chen Meng, Sheng Gui, Weifei Yu, Yi Xie
CVPRW 2018 Efficient Deep Learning Inference Based on Model Compression Qing Zhang, Mengru Zhang, Mengdi Wang, Wanchen Sui, Chen Meng, Jun Yang, Weidan Kong, Xiaoyuan Cui, Wei Lin