Goyal, Navita

1 publications

NeurIPS 2025 Causal Differentiating Concepts: Interpreting LM Behavior via Causal Representation Learning Navita Goyal, Hal Daumé Iii, Alexandre Drouin, Dhanya Sridhar