Li, Ruixiao

2 publications

ICLR 2025 ReAttention: Training-Free Infinite Context with Finite Attention Scope Xiaoran Liu, Ruixiao Li, Zhigeng Liu, Qipeng Guo, Yuerong Song, Kai Lv, Hang Yan, Linlin Li, Qun Liu, Xipeng Qiu
IJCAI 2025 Semi-Clairvoyant Scheduling of Speculative Decoding Requests to Minimize LLM Inference Latency Ruixiao Li, Fahao Chen, Peng Li