Gu, Changwei

1 publications

NeurIPS 2025 Sim-LLM: Optimizing LLM Inference at the Edge Through Inter-Task KV Reuse Ruikun Luo, Changwei Gu, Qiang He, Feifei Chen, Song Wu, Hai Jin, Yun Yang