Satyanarayan, Arvind

3 publications

ECCVW 2024 Explanation Alignment: Quantifying the Correctness of Model Reasoning at Scale Hyemin Bang, Angie W. Boggust, Arvind Satyanarayan
AAAI 2022 Teaching Humans When to Defer to a Classifier via Exemplars Hussein Mozannar, Arvind Satyanarayan, David A. Sontag
Distill 2018 The Building Blocks of Interpretability Chris Olah, Arvind Satyanarayan, Ian Johnson, Shan Carter, Ludwig Schubert, Katherine Ye, Alexander Mordvintsev