ML Anthology
Authors
Search
About
Yu, Jiangyong
2 publications
ICML
2025
MoEQuant: Enhancing Quantization for Mixture-of-Experts Large Language Models via Expert-Balanced Sampling and Affinity Guidance
Zhixuan Chen
,
Xing Hu
,
Dawei Yang
,
Zukang Xu
,
Xu Chen
,
Zhihang Yuan
,
Sifan Zhou
,
Jiangyong Yu
ICML
2025
RWKVQuant: Quantizing the RWKV Family with Proxy Guided Hybrid of Scalar and Vector Quantization
Chen Xu
,
Yuxuan Yue
,
Zukang Xu
,
Xing Hu
,
Jiangyong Yu
,
Zhixuan Chen
,
Sifan Zhou
,
Zhihang Yuan
,
Dawei Yang