ML Anthology
Authors
Search
About
Marks, Luke
2 publications
ICLR
2026
Output Supervision Can Obfuscate the Chain of Thought
Jacob Drori
,
Luke Marks
,
Bryce Woodworth
,
Alex Cloud
,
Alexander Matt Turner
NeurIPS
2024
Interpreting Learned Feedback Patterns in Large Language Models
Luke Marks
,
Amir Abdullah
,
Clement Neo
,
Rauno Arike
,
David Krueger
,
Philip Torr
,
Fazl Barez