ML Anthology
Authors
Search
About
Williams, Kai
1 publications
NeurIPS
2024
Representation Noising: A Defence Mechanism Against Harmful Finetuning
Domenic Rosati
,
Jan Wehner
,
Kai Williams
,
Ćukasz Bartoszcze
,
David Atanasov
,
Robie Gonzales
,
Subhabrata Majumdar
,
Carsten Maple
,
Hassan Sajjad
,
Frank Rudzicz