Xu, Zhanchao

1 publications

TMLR 2025 A Survey on Large Language Model Acceleration Based on KV Cache Management Haoyang Li, Yiming Li, Anxin Tian, Tianhao Tang, Zhanchao Xu, Xuejia Chen, Nicole Hu, Wei Dong, Li Qing, Lei Chen