Liu, Xueshen

2 publications

ICML 2025 Compute or Load KV Cache? Why Not Both? Shuowei Jin, Xueshen Liu, Qingzhao Zhang, Zhuoqing Mao
NeurIPS 2024 Learn to Be Efficient: Build Structured Sparsity in Large Language Models Haizhong Zheng, Xiaoyan Bai, Xueshen Liu, Z. Morley Mao, Beidi Chen, Fan Lai, Atul Prakash