Qiu, Lirong

2 publications

IJCAI 2025 Feint and Attack: Jailbreaking and Protecting LLMs via Attention Distribution Modeling Rui Pu, Chaozhuo Li, Rui Ha, Zejian Chen, Litian Zhang, Zheng Liu, Lirong Qiu, Zaisheng Ye
NeurIPS 2025 OSTAR: Optimized Statistical Text-Classifier with Adversarial Resistance Yuhan Yao, Feifei Kou, Lei Shi, Xiao Yang, Zhongbao Zhang, Suguo Zhu, Jiwei Zhang, Lirong Qiu, Li Haisheng