Lermen, Simon

2 publications

NeurIPSW 2024 Applying Refusal-Vector Ablation to Llama 3.1 70b Agents Simon Lermen, Mateusz Dziemian, Govind Pimpale
ICLRW 2024 LoRA Fine-Tuning Efficiently Undoes Safety Training in Llama 2-Chat 70b Simon Lermen, Charlie Rogers-Smith