Riccardi, Annalisa

1 publications

TMLR 2025 Head-Specific Intervention Can Induce Misaligned AI Coordination in Large Language Models Paul Darm, Annalisa Riccardi