Turner, Alexander Matt

3 publications

ICLR 2026 Output Supervision Can Obfuscate the Chain of Thought Jacob Drori, Luke Marks, Bryce Woodworth, Alex Cloud, Alexander Matt Turner
NeurIPS 2025 Distillation Robustifies Unlearning Bruce W. Lee, Addie Foote, Alex Infanger, Leni Shor, Harish K Kamath, Jacob Goldman-Wetzler, Bryce Woodworth, Alex Cloud, Alexander Matt Turner
NeurIPSW 2022 Formalizing the Problem of Side Effect Regularization Alexander Matt Turner, Aseem Saxena, Prasad Tadepalli