ML Anthology
Authors
Search
About
Riccardi, Annalisa
1 publications
TMLR
2025
Head-Specific Intervention Can Induce Misaligned AI Coordination in Large Language Models
Paul Darm
,
Annalisa Riccardi