Reber, David

2 publications

ICML 2025 RATE: Causal Explainability of Reward Models with Imperfect Counterfactuals David Reber, Sean M Richardson, Todd Nief, Cristina Garbacea, Victor Veitch
NeurIPSW 2023 What's Your Use Case? a Taxonomy of Causal Evaluations of Post-Hoc Interpretability David Reber, Cristina Garbacea, Victor Veitch