ML Anthology
Authors
Search
About
Mei, Shaohui
1 publications
NeurIPS
2025
Semantic Representation Attack Against Aligned Large Language Models
Jiawei Lian
,
Jianhong Pan
,
Lefan Wang
,
Yi Wang
,
Shaohui Mei
,
Lap-Pui Chau