Liu, Zeli

1 publications

NeurIPSW 2024 MoQ: Mixture-of-Format Activation Quantization for Communication-Efficient AI Inference System Haonan Wang, Zeli Liu, Chao Fang, John Paul Walters, Stephen P. Crago