Interpretable Object Recognition by Semantic Prototype Analysis

Abstract

People can usually give reasons for recognizing a particular object as a specific category, using various means such as body language (by pointing out) and natural language (by telling). This inspires us to develop a recognition model with such principles to explain the recognition process to enhance human trust. We propose Semantic Prototype Analysis Network (SPANet), an interpretable object recognition approach that enables models to explicate the decision process more lucidly and comprehensibly to humans by "pointing out where to focus" and "telling about why it is" simultaneously. With the proposed method, some part prototypes with semantic concepts will be provided to elaborate on the classification together with a group of visualized samples to achieve both part-wise and semantic interpretability. The results of extensive experiments demonstrate that SPANet is able to recognize objects almost as well as the non-interpretable models, at the same time generating intelligible explanations for its decision process.

Cite

Text

Wan et al. "Interpretable Object Recognition by Semantic Prototype Analysis." Winter Conference on Applications of Computer Vision, 2024.

Markdown

[Wan et al. "Interpretable Object Recognition by Semantic Prototype Analysis." Winter Conference on Applications of Computer Vision, 2024.](https://mlanthology.org/wacv/2024/wan2024wacv-interpretable/)

BibTeX

@inproceedings{wan2024wacv-interpretable,
  title     = {{Interpretable Object Recognition by Semantic Prototype Analysis}},
  author    = {Wan, Qiyang and Wang, Ruiping and Chen, Xilin},
  booktitle = {Winter Conference on Applications of Computer Vision},
  year      = {2024},
  pages     = {800-809},
  url       = {https://mlanthology.org/wacv/2024/wan2024wacv-interpretable/}
}