Zhang, Jihang

2 publications

ICLR 2025 HShare: Fast LLM Decoding by Hierarchical Key-Value Sharing Huaijin Wu, Lianqiang Li, Hantao Huang, Tu Yi, Jihang Zhang, Minghui Yu, Junchi Yan
NeurIPS 2025 SALS: Sparse Attention in Latent Space for KV Cache Compression Junlin Mu, Hantao Huang, Jihang Zhang, Minghui Yu, Tao Wang, Yidong Li