ML Anthology
Authors
Search
About
Reuel, Ann-Katrin
3 publications
NeurIPSW
2023
Analyzing and Editing Inner Mechanisms of Backdoored Language Models
Max Lamparth
,
Ann-Katrin Reuel
NeurIPSW
2023
Assessing Risks of Using Autonomous Language Models in Military and Diplomatic Planning
Gabriel Mukobi
,
Ann-Katrin Reuel
,
Juan-Pablo Rivera
,
Chandler Smith
AAAI
2022
Using Adaptive Stress Testing to Identify Paths to Ethical Dilemmas in Autonomous Systems
Ann-Katrin Reuel
,
Mark Koren
,
Anthony Corso
,
Mykel J. Kochenderfer