ML Anthology
Authors
Search
About
Yang, Lijie
2 publications
ICLR
2025
TidalDecode: Fast and Accurate LLM Decoding with Position Persistent Sparse Attention
Lijie Yang
,
Zhihao Zhang
,
Zhuofu Chen
,
Zikun Li
,
Zhihao Jia
ICML
2024
Accelerating Iterative Retrieval-Augmented Language Model Serving with Speculation
Zhihao Zhang
,
Alan Zhu
,
Lijie Yang
,
Yihua Xu
,
Lanting Li
,
Phitchaya Mangpo Phothilimthana
,
Zhihao Jia