Hu, Zhicheng

1 publications

ACML 2025 Round Attention: A Novel Round-Level Attention Mechanism to Accelerate LLM Inference Yaohua Tang, Zhicheng Hu, Kun Cheng, Fan Mo, Qiheng Lyu, Hua Wang, Zhi Chen