ML Anthology
Authors
Search
About
Christiani, Marco
1 publications
ICLR
2025
Concept-ROT: Poisoning Concepts in Large Language Models with Model Editing
Keltin Grimes
,
Marco Christiani
,
David Shriver
,
Marissa Catherine Connor