ML Anthology
Authors
Search
About
Darm, Paul
1 publications
TMLR
2025
Head-Specific Intervention Can Induce Misaligned AI Coordination in Large Language Models
Paul Darm
,
Annalisa Riccardi