Baumann, Thomas

1 publications

NeurIPSW 2024 Universal Jailbreak Backdoors in Large Language Model Alignment Thomas Baumann