ML Anthology
Authors
Search
About
Lermen, Simon
2 publications
NeurIPSW
2024
Applying Refusal-Vector Ablation to Llama 3.1 70b Agents
Simon Lermen
,
Mateusz Dziemian
,
Govind Pimpale
ICLRW
2024
LoRA Fine-Tuning Efficiently Undoes Safety Training in Llama 2-Chat 70b
Simon Lermen
,
Charlie Rogers-Smith