Lin, Leon

1 publications

AAAI 2025 Single Character Perturbations Break LLM Alignment Leon Lin, Hannah Brown, Kenji Kawaguchi, Michael Shieh