Alex, Neel

3 publications

ICLR 2025 Protecting Against Simultaneous Data Poisoning Attacks Neel Alex, Shoaib Ahmed Siddiqui, Amartya Sanyal, David Krueger
NeurIPSW 2023 Detecting Backdoors with Meta-Models Lauro Langosco, Neel Alex, William Baker, David Quarel, Herbie Bradley, David Krueger
NeurIPSW 2023 Goal Misgeneralization as Implicit Goal Conditioning Diego Dorn, Neel Alex, David Krueger