Wang, Qianle

1 publications

NeurIPS 2024 Cherry on Top: Parameter Heterogeneity and Quantization in Large Language Models Wanyun Cui, Qianle Wang