Li, Yige

11 publications

CVPR 2025 Anyattack: Towards Large-Scale Self-Supervised Adversarial Attacks on Vision-Language Models Jiaming Zhang, Junhong Ye, Xingjun Ma, Yige Li, Yunfan Yang, Yunhao Chen, Jitao Sang, Dit-Yan Yeung
AAAI 2025 Backdoor Token Unlearning: Exposing and Defending Backdoors in Pretrained Language Models Peihai Jiang, Xixiang Lyu, Yige Li, Jing Ma
NeurIPS 2025 BackdoorLLM: A Comprehensive Benchmark for Backdoor Attacks and Defenses on Large Language Models Yige Li, Hanxun Huang, Yunhan Zhao, Xingjun Ma, Jun Sun
ICLR 2025 BlueSuffix: Reinforced Blue Teaming for Vision-Language Models Against Jailbreak Attacks Yunhan Zhao, Xiang Zheng, Lin Luo, Yige Li, Xingjun Ma, Yu-Gang Jiang
ICML 2025 CROW: Eliminating Backdoors from Large Language Models via Internal Consistency Regularization Nay Myat Min, Long H. Pham, Yige Li, Jun Sun
ICLR 2025 Detecting Backdoor Samples in Contrastive Language Image Pretraining Hanxun Huang, Sarah Monazam Erfani, Yige Li, Xingjun Ma, James Bailey
NeurIPS 2025 Memory Injection Attacks on LLM Agents via Query-Only Interaction Shen Dong, Shaochen Xu, Pengfei He, Yige Li, Jiliang Tang, Tianming Liu, Hui Liu, Zhen Xiang
ICML 2025 X-Transfer Attacks: Towards Super Transferable Adversarial Attacks on CLIP Hanxun Huang, Sarah Monazam Erfani, Yige Li, Xingjun Ma, James Bailey
ICML 2023 Reconstructive Neuron Pruning for Backdoor Defense Yige Li, Xixiang Lyu, Xingjun Ma, Nodens Koren, Lingjuan Lyu, Bo Li, Yu-Gang Jiang
NeurIPS 2021 Anti-Backdoor Learning: Training Clean Models on Poisoned Data Yige Li, Xixiang Lyu, Nodens Koren, Lingjuan Lyu, Bo Li, Xingjun Ma
ICLR 2021 Neural Attention Distillation: Erasing Backdoor Triggers from Deep Neural Networks Yige Li, Xixiang Lyu, Nodens Koren, Lingjuan Lyu, Bo Li, Xingjun Ma