Yang, Xiaohu

1 publications

ICLR 2026 Safety at One Shot: Patching Fine-Tuned LLMs with a Single Instance Jiawen Zhang, Tony He, Kejia Chen, Jian Lou, Jian Liu, Xiaohu Yang, Ruoxi Jia