Lu, Baotong

2 publications

NeurIPS 2025 RetrievalAttention: Accelerating Long-Context LLM Inference via Vector Retrieval Di Liu, Meng Chen, Baotong Lu, Huiqiang Jiang, Zhenhua Han, Qianxi Zhang, Qi Chen, Chengruidong Zhang, Bailu Ding, Kai Zhang, Chen Chen, Fan Yang, Yuqing Yang, Lili Qiu
AAAI 2020 High Performance Depthwise and Pointwise Convolutions on Mobile Devices Pengfei Zhang, Eric Lo, Baotong Lu