Arai, Masaki

1 publications

ACML 2025 Jailbreak Defense in LLM via Attention Head Analysis and Selective Intervention Masaki Arai, Toshiki Shibahara, Daiki Chiba, Mitsuaki Akiyama, Masato Uchida