Kirch, Nathalie Maria

2 publications

NeurIPSW 2024 TRIAGE: Ethical Benchmarking of AI Models Through Mass Casualty Simulations Nathalie Maria Kirch, Konstantin Hebenstreit, Matthias Samwald
NeurIPSW 2024 What Features in Prompts Jailbreak LLMs? Investigating the Mechanisms Behind Attacks Nathalie Maria Kirch, Severin Field, Stephen Casper