Li, Yucheng

12 publications

ICLRW 2025 Estimating Contamination via Perplexity: Quantifying Memorisation in Language Model Evaluation Yucheng Li
ICML 2025 MMInference: Accelerating Pre-Filling for Long-Context Visual Language Models via Modality-Aware Permutation Sparse Attention Yucheng Li, Huiqiang Jiang, Chengruidong Zhang, Qianhui Wu, Xufang Luo, Surin Ahn, Amir H. Abdi, Dongsheng Li, Jianfeng Gao, Yuqing Yang, Lili Qiu
ICLRW 2025 MMInference: Accelerating Pre-Filling for Long-Context Visual Language Models via Modality-Aware Permutation Sparse Attention Yucheng Li, Huiqiang Jiang, Chengruidong Zhang, Qianhui Wu, Xufang Luo, Surin Ahn, Amir H. Abdi, Dongsheng Li, Jianfeng Gao, Yuqing Yang, Lili Qiu
NeurIPS 2025 R-KV: Redundancy-Aware KV Cache Compression for Reasoning Models Zefan Cai, Wen Xiao, Hanshi Sun, Cheng Luo, Yikai Zhang, Ke Wan, Yucheng Li, Yeyang Zhou, Li-Wen Chang, Jiuxiang Gu, Zhen Dong, Anima Anandkumar, Abedelkadir Asi, Junjie Hu
ICLR 2025 SCBench: A KV Cache-Centric Analysis of Long-Context Methods Yucheng Li, Huiqiang Jiang, Qianhui Wu, Xufang Luo, Surin Ahn, Chengruidong Zhang, Amir H. Abdi, Dongsheng Li, Jianfeng Gao, Yuqing Yang, Lili Qiu
AAAI 2024 LatestEval: Addressing Data Contamination in Language Model Evaluation Through Dynamic and Time-Sensitive Test Construction Yucheng Li, Frank Guerin, Chenghua Lin
NeurIPS 2024 MInference 1.0: Accelerating Pre-Filling for Long-Context LLMs via Dynamic Sparse Attention Huiqiang Jiang, Yucheng Li, Chengruidong Zhang, Qianhui Wu, Xufang Luo, Surin Ahn, Zhenhua Han, Amir H. Abdi, Dongsheng Li, Chin-Yew Lin, Yuqing Yang, Lili Qiu
ICMLW 2024 MInference: Accelerating Pre-Filling for Long-Context LLMs via Dynamic Sparse Attention Huiqiang Jiang, Yucheng Li, Chengruidong Zhang, Qianhui Wu, Xufang Luo, Surin Ahn, Zhenhua Han, Amir H. Abdi, Dongsheng Li, Chin-Yew Lin, Yuqing Yang, Lili Qiu
NeurIPS 2022 Cache-Augmented Inbatch Importance Resampling for Training Recommender Retriever Jin Chen, Defu Lian, Yucheng Li, Baoyun Wang, Kai Zheng, Enhong Chen
IJCAI 2021 Consistent Inference for Dialogue Relation Extraction Xinwei Long, Shuzi Niu, Yucheng Li
ECML-PKDD 2016 Efficient Bayesian Maximum Margin Multiple Kernel Learning Changying Du, Changde Du, Guoping Long, Xin Jin, Yucheng Li
UAI 2016 Online Bayesian Multiple Kernel Bipartite Ranking Changying Du, Changde Du, Guoping Long, Qing He, Yucheng Li