CoCoX: Generating Conceptual and Counterfactual Explanations via Fault-Lines

Abstract

We present CoCoX (short for Conceptual and Counterfactual Explanations), a model for explaining decisions made by a deep convolutional neural network (CNN). In Cognitive Psychology, the factors (or semantic-level features) that humans zoom in on when they imagine an alternative to a model prediction are often referred to as fault-lines. Motivated by this, our CoCoX model explains decisions made by a CNN using fault-lines. Specifically, given an input image I for which a CNN classification model M predicts class cpred, our fault-line based explanation identifies the minimal semantic-level features (e.g., stripes on zebra, pointed ears of dog), referred to as explainable concepts, that need to be added to or deleted from I in order to alter the classification category of I by M to another specified class calt. We argue that, due to the conceptual and counterfactual nature of fault-lines, our CoCoX explanations are practical and more natural for both expert and non-expert users to understand the internal workings of complex deep learning models. Extensive quantitative and qualitative experiments verify our hypotheses, showing that CoCoX significantly outperforms the state-of-the-art explainable AI models. Our implementation is available at https://github.com/arjunakula/CoCoX

Cite

Text

Akula et al. "CoCoX: Generating Conceptual and Counterfactual Explanations via Fault-Lines." AAAI Conference on Artificial Intelligence, 2020. doi:10.1609/AAAI.V34I03.5643

Markdown

[Akula et al. "CoCoX: Generating Conceptual and Counterfactual Explanations via Fault-Lines." AAAI Conference on Artificial Intelligence, 2020.](https://mlanthology.org/aaai/2020/akula2020aaai-cocox/) doi:10.1609/AAAI.V34I03.5643

BibTeX

@inproceedings{akula2020aaai-cocox,
  title     = {{CoCoX: Generating Conceptual and Counterfactual Explanations via Fault-Lines}},
  author    = {Akula, Arjun R. and Wang, Shuai and Zhu, Song-Chun},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2020},
  pages     = {2594-2601},
  doi       = {10.1609/AAAI.V34I03.5643},
  url       = {https://mlanthology.org/aaai/2020/akula2020aaai-cocox/}
}