Graux, Romain

1 publications

NeurIPSW 2023 Second-Order Jailbreaks: Generative Agents Successfully Manipulate Through an Intermediary Mikhail Terekhov, Romain Graux, Eduardo Neville, Denis Rosset, Gabin Kolly