ML Anthology
Authors
Search
About
Meng, Chen
3 publications
ICLRW
2024
Distributed Inference Performance Optimization for LLMs on CPUs
Pujiang He
,
Shan Zhou
,
Changqing Li
,
Wenhuan Huang
,
Weifei Yu
,
Duyi Wang
,
Chen Meng
,
Sheng Gui
ICMLW
2024
Inference Performance Optimization for Large Language Models on CPUs
Pujiang He
,
Shan Zhou
,
Wenhuan Huang
,
Changqing Li
,
Duyi Wang
,
Bin Guo
,
Chen Meng
,
Sheng Gui
,
Weifei Yu
,
Yi Xie
CVPRW
2018
Efficient Deep Learning Inference Based on Model Compression
Qing Zhang
,
Mengru Zhang
,
Mengdi Wang
,
Wanchen Sui
,
Chen Meng
,
Jun Yang
,
Weidan Kong
,
Xiaoyuan Cui
,
Wei Lin