ML Anthology
Authors
Search
About
Yang, Xiaohu
1 publications
ICLR
2026
Safety at One Shot: Patching Fine-Tuned LLMs with a Single Instance
Jiawen Zhang
,
Tony He
,
Kejia Chen
,
Jian Lou
,
Jian Liu
,
Xiaohu Yang
,
Ruoxi Jia