Nief, Todd

1 publications

ICML 2025 RATE: Causal Explainability of Reward Models with Imperfect Counterfactuals David Reber, Sean M Richardson, Todd Nief, Cristina Garbacea, Victor Veitch