ML Anthology
Authors
Search
About
Tagade, Arush
1 publications
NeurIPSW
2023
Scalable and Transferable Black-Box Jailbreaks for Language Models via Persona Modulation
Rusheb Shah
,
Quentin Feuillade Montixi
,
Soroush Pour
,
Arush Tagade
,
Javier Rando