Cohen, Jonathan D.
19 publications
NeurIPS
2025
Causal Head Gating: A Framework for Interpreting Roles of Attention Heads in Transformers
NeurIPS
2024
Understanding the Limits of Vision Language Models Through the Lens of the Binding Problem
NeurIPS
2022
Using Natural Language and Program Abstractions to Instill Human Inductive Biases in Machines