ML Anthology
Authors
Search
About
Kirch, Nathalie Maria
2 publications
NeurIPSW
2024
TRIAGE: Ethical Benchmarking of AI Models Through Mass Casualty Simulations
Nathalie Maria Kirch
,
Konstantin Hebenstreit
,
Matthias Samwald
NeurIPSW
2024
What Features in Prompts Jailbreak LLMs? Investigating the Mechanisms Behind Attacks
Nathalie Maria Kirch
,
Severin Field
,
Stephen Casper