Nief, Todd

2 publications

ICLR 2026 Dynamic Weight Grafting: Localizing Finetuned Factual Knowledge in Transformers Todd Nief, David Reber, Sean M. Richardson, Ari Holtzman
ICML 2025 RATE: Causal Explainability of Reward Models with Imperfect Counterfactuals David Reber, Sean M Richardson, Todd Nief, Cristina Garbacea, Victor Veitch