ML Anthology
Authors
Search
About
Yi, Biao
1 publications
ICLR
2025
Probe Before You Talk: Towards Black-Box Defense Against Backdoor Unalignment for Large Language Models
Biao Yi
,
Tiansheng Huang
,
Sishuo Chen
,
Tong Li
,
Zheli Liu
,
Zhixuan Chu
,
Yiming Li