ML Anthology
Authors
Search
About
Huang, Hantao
2 publications
ICLR
2025
HShare: Fast LLM Decoding by Hierarchical Key-Value Sharing
Huaijin Wu
,
Lianqiang Li
,
Hantao Huang
,
Tu Yi
,
Jihang Zhang
,
Minghui Yu
,
Junchi Yan
NeurIPS
2025
SALS: Sparse Attention in Latent Space for KV Cache Compression
Junlin Mu
,
Hantao Huang
,
Jihang Zhang
,
Minghui Yu
,
Tao Wang
,
Yidong Li