Abdullah, Amir

2 publications

ICML 2025 Activation Space Interventions Can Be Transferred Between Large Language Models Narmeen Fatimah Oozeer, Dhruv Nathawani, Nirmalendu Prakash, Michael Lan, Abir Harrasse, Amir Abdullah
NeurIPS 2024 Interpreting Learned Feedback Patterns in Large Language Models Luke Marks, Amir Abdullah, Clement Neo, Rauno Arike, David Krueger, Philip Torr, Fazl Barez