ML Anthology
Authors
Search
About
Yu, Jiangyong
3 publications
ICLR
2026
NLI : Non-Uniform Linear Interpolation Approximation of Nonlinear Operations for Efficient LLMs Inference
Jiangyong Yu
,
Xiaomeng Han
,
Xing Hu
,
Chen Xu
,
Zhe Jiang
,
Dawei Yang
ICML
2025
MoEQuant: Enhancing Quantization for Mixture-of-Experts Large Language Models via Expert-Balanced Sampling and Affinity Guidance
Zhixuan Chen
,
Xing Hu
,
Dawei Yang
,
Zukang Xu
,
Xu Chen
,
Zhihang Yuan
,
Sifan Zhou
,
Jiangyong Yu
ICML
2025
RWKVQuant: Quantizing the RWKV Family with Proxy Guided Hybrid of Scalar and Vector Quantization
Chen Xu
,
Yuxuan Yue
,
Zukang Xu
,
Xing Hu
,
Jiangyong Yu
,
Zhixuan Chen
,
Sifan Zhou
,
Zhihang Yuan
,
Dawei Yang