Tagade, Arush

1 publications

NeurIPSW 2023 Scalable and Transferable Black-Box Jailbreaks for Language Models via Persona Modulation Rusheb Shah, Quentin Feuillade Montixi, Soroush Pour, Arush Tagade, Javier Rando