Srinivas, Suraj
22 publications
NeurIPS
2023
Discriminative Feature Attributions: Bridging Post Hoc Explainability and Inherent Interpretability
NeurIPS
2023
Which Models Have Perceptually-Aligned Gradients? an Explanation via Off-Manifold Robustness
ICMLW
2023
Which Models Have Perceptually-Aligned Gradients? an Explanation via Off-Manifold Robustness