ML Anthology
Authors
Search
About
Reber, David
3 publications
ICLR
2026
Dynamic Weight Grafting: Localizing Finetuned Factual Knowledge in Transformers
Todd Nief
,
David Reber
,
Sean M. Richardson
,
Ari Holtzman
ICML
2025
RATE: Causal Explainability of Reward Models with Imperfect Counterfactuals
David Reber
,
Sean M Richardson
,
Todd Nief
,
Cristina Garbacea
,
Victor Veitch
NeurIPSW
2023
What's Your Use Case? a Taxonomy of Causal Evaluations of Post-Hoc Interpretability
David Reber
,
Cristina Garbacea
,
Victor Veitch