ML Anthology
Authors
Search
About
Syed, Aaquib
5 publications
ICML
2025
Mechanistic Unlearning: Robust Knowledge Unlearning and Editing via Mechanistic Localization
Phillip Huang Guo
,
Aaquib Syed
,
Abhay Sheshadri
,
Aidan Ewart
,
Gintare Karolina Dziugaite
NeurIPS
2024
Refusal in Language Models Is Mediated by a Single Direction
Andy Arditi
,
Oscar Obeso
,
Aaquib Syed
,
Daniel Paleka
,
Nina Panickssery
,
Wes Gurnee
,
Neel Nanda
ICMLW
2024
Refusal in Language Models Is Mediated by a Single Direction
Andy Arditi
,
Oscar Balcells Obeso
,
Aaquib Syed
,
Daniel Paleka
,
Nina Panickssery
,
Wes Gurnee
,
Neel Nanda
ICMLW
2024
Robust Knowledge Unlearning via Mechanistic Localizations
Phillip Huang Guo
,
Aaquib Syed
,
Abhay Sheshadri
,
Aidan Ewart
,
Gintare Karolina Dziugaite
ICMLW
2024
Robust Unlearning via Mechanistic Localizations
Phillip Huang Guo
,
Aaquib Syed
,
Abhay Sheshadri
,
Aidan Ewart
,
Gintare Karolina Dziugaite