Mei, Shaohui

1 publications

NeurIPS 2025 Semantic Representation Attack Against Aligned Large Language Models Jiawei Lian, Jianhong Pan, Lefan Wang, Yi Wang, Shaohui Mei, Lap-Pui Chau