Li, Lanting

1 publications

ICML 2024 Accelerating Iterative Retrieval-Augmented Language Model Serving with Speculation Zhihao Zhang, Alan Zhu, Lijie Yang, Yihua Xu, Lanting Li, Phitchaya Mangpo Phothilimthana, Zhihao Jia