ML Anthology
Authors
Search
About
Abdullah, Amir
2 publications
ICML
2025
Activation Space Interventions Can Be Transferred Between Large Language Models
Narmeen Fatimah Oozeer
,
Dhruv Nathawani
,
Nirmalendu Prakash
,
Michael Lan
,
Abir Harrasse
,
Amir Abdullah
NeurIPS
2024
Interpreting Learned Feedback Patterns in Large Language Models
Luke Marks
,
Amir Abdullah
,
Clement Neo
,
Rauno Arike
,
David Krueger
,
Philip Torr
,
Fazl Barez