ML Anthology
Authors
Search
About
Bartoszcze, Łukasz
1 publications
NeurIPS
2024
Representation Noising: A Defence Mechanism Against Harmful Finetuning
Domenic Rosati
,
Jan Wehner
,
Kai Williams
,
Łukasz Bartoszcze
,
David Atanasov
,
Robie Gonzales
,
Subhabrata Majumdar
,
Carsten Maple
,
Hassan Sajjad
,
Frank Rudzicz