ML Anthology
Authors
Search
About
Alex, Neel
3 publications
ICLR
2025
Protecting Against Simultaneous Data Poisoning Attacks
Neel Alex
,
Shoaib Ahmed Siddiqui
,
Amartya Sanyal
,
David Krueger
NeurIPSW
2023
Detecting Backdoors with Meta-Models
Lauro Langosco
,
Neel Alex
,
William Baker
,
David Quarel
,
Herbie Bradley
,
David Krueger
NeurIPSW
2023
Goal Misgeneralization as Implicit Goal Conditioning
Diego Dorn
,
Neel Alex
,
David Krueger