ML Anthology
Authors
Search
About
Xing, Zeyu
2 publications
ICLR
2026
Beyond Speedup - Utilizing KV Cache for Sampling and Reasoning
Zeyu Xing
,
Xing Li
,
Hui-Ling Zhen
,
Mingxuan Yuan
,
Sinno Jialin Pan
ICML
2025
KVTuner: Sensitivity-Aware Layer-Wise Mixed-Precision KV Cache Quantization for Efficient and Nearly Lossless LLM Inference
Xing Li
,
Zeyu Xing
,
Yiming Li
,
Linping Qu
,
Hui-Ling Zhen
,
Yiwu Yao
,
Wulong Liu
,
Sinno Jialin Pan
,
Mingxuan Yuan