ML Anthology
Authors
Search
About
Nief, Todd
1 publications
ICML
2025
RATE: Causal Explainability of Reward Models with Imperfect Counterfactuals
David Reber
,
Sean M Richardson
,
Todd Nief
,
Cristina Garbacea
,
Victor Veitch