NeSyFOLD: A Framework for Interpretable Image Classification

Abstract

Deep learning models such as CNNs have surpassed human performance in computer vision tasks such as image classi- fication. However, despite their sophistication, these models lack interpretability which can lead to biased outcomes re- flecting existing prejudices in the data. We aim to make pre- dictions made by a CNN interpretable. Hence, we present a novel framework called NeSyFOLD to create a neurosym- bolic (NeSy) model for image classification tasks. The model is a CNN with all layers following the last convolutional layer replaced by a stratified answer set program (ASP) derived from the last layer kernels. The answer set program can be viewed as a rule-set, wherein the truth value of each pred- icate depends on the activation of the corresponding kernel in the CNN. The rule-set serves as a global explanation for the model and is interpretable. We also use our NeSyFOLD framework with a CNN that is trained using a sparse kernel learning technique called Elite BackProp (EBP). This leads to a significant reduction in rule-set size without compromising accuracy or fidelity thus improving scalability of the NeSy model and interpretability of its rule-set. Evaluation is done on datasets with varied complexity and sizes. We also pro- pose a novel algorithm for labelling the predicates in the rule- set with meaningful semantic concept(s) learnt by the CNN. We evaluate the performance of our “semantic labelling algo- rithm” to quantify the efficacy of the semantic labelling for both the NeSy model and the NeSy-EBP model.

Cite

Text

Padalkar et al. "NeSyFOLD: A Framework for Interpretable Image Classification." AAAI Conference on Artificial Intelligence, 2024. doi:10.1609/AAAI.V38I5.28235

Markdown

[Padalkar et al. "NeSyFOLD: A Framework for Interpretable Image Classification." AAAI Conference on Artificial Intelligence, 2024.](https://mlanthology.org/aaai/2024/padalkar2024aaai-nesyfold/) doi:10.1609/AAAI.V38I5.28235

BibTeX

@inproceedings{padalkar2024aaai-nesyfold,
  title     = {{NeSyFOLD: A Framework for Interpretable Image Classification}},
  author    = {Padalkar, Parth and Wang, Huaduo and Gupta, Gopal},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2024},
  pages     = {4378-4387},
  doi       = {10.1609/AAAI.V38I5.28235},
  url       = {https://mlanthology.org/aaai/2024/padalkar2024aaai-nesyfold/}
}