ML Anthology
Authors
Search
About
Goyal, Navita
1 publications
NeurIPS
2025
Causal Differentiating Concepts: Interpreting LM Behavior via Causal Representation Learning
Navita Goyal
,
Hal Daumé Iii
,
Alexandre Drouin
,
Dhanya Sridhar