Zhu, Zihao
12 publications
ICLR
2026
AdvChain: Adversarial Chain-of-Thought Tuning for Robust Safety Alignment of Large Reasoning Models
ICLR
2026
Reliable Poisoned Sample Detection Against Backdoor Attacks Enhanced by Sharpness Aware Minimization
12 publications