Jiang, Yikun

1 publications

NeurIPS 2024 D-LLM: A Token Adaptive Computing Resource Allocation Strategy for Large Language Models Yikun Jiang, Huanyu Wang, Lei Xie, Hanbin Zhao, Chao Zhang, Hui Qian, John C.S. Lui