ML Anthology
Authors
Search
About
Jiang, Yikun
1 publications
NeurIPS
2024
D-LLM: A Token Adaptive Computing Resource Allocation Strategy for Large Language Models
Yikun Jiang
,
Huanyu Wang
,
Lei Xie
,
Hanbin Zhao
,
Chao Zhang
,
Hui Qian
,
John C.S. Lui