Christiani, Marco

1 publications

ICLR 2025 Concept-ROT: Poisoning Concepts in Large Language Models with Model Editing Keltin Grimes, Marco Christiani, David Shriver, Marissa Catherine Connor