ML Anthology
Authors
Search
About
Rosset, Denis
1 publications
NeurIPSW
2023
Second-Order Jailbreaks: Generative Agents Successfully Manipulate Through an Intermediary
Mikhail Terekhov
,
Romain Graux
,
Eduardo Neville
,
Denis Rosset
,
Gabin Kolly