Huynh, Tran

4 publications

ICLR 2026 Adversarial Déjà Vu: Jailbreak Dictionary Learning for Stronger Generalization to Unseen Attacks Mahavir Dabas, Tran Huynh, Nikhil Reddy Billa, Jiachen T. Wang, Peng Gao, Charith Peris, Yao Ma, Rahul Gupta, Ming Jin, Prateek Mittal, Ruoxi Jia
ICLR 2026 Inference-Time Personalized Safety Control via Paired Difference-in-Means Intervention Tran Huynh, Ruoxi Jia
AAAI 2024 COMBAT: Alternated Training for Effective Clean-Label Backdoor Attacks Tran Huynh, Dang Nguyen, Tung Pham, Anh Tran
ECCV 2024 Data Poisoning Quantization Backdoor Attack Tran Huynh, Anh Tran, Khoa Doan, Tung Pham