ML Anthology
Authors
Search
About
Cao, Zouying
1 publications
AAAI
2025
SCANS: Mitigating the Exaggerated Safety for LLMs via Safety-Conscious Activation Steering
Zouying Cao
,
Yifei Yang
,
Hai Zhao