Cao, Zouying

1 publications

AAAI 2025 SCANS: Mitigating the Exaggerated Safety for LLMs via Safety-Conscious Activation Steering Zouying Cao, Yifei Yang, Hai Zhao