ML Anthology
Authors
Search
About
de Lucena, Diogo Schwerz
1 publications
ICLRW
2024
Rethinking Harmless Refusals When Fine-Tuning Foundation Models
Florin Pop
,
Judd Rosenblatt
,
Diogo Schwerz de Lucena
,
Michael Vaiana