Convolutional Dynamic Alignment Networks for Interpretable Classifications

Abstract

We introduce a new family of neural network models called Convolutional Dynamic Alignment Networks (CoDA-Nets), which are performant classifiers with a high degree of inherent interpretability. Their core building blocks are Dynamic Alignment Units (DAUs), which linearly transform their input with weight vectors that dynamically align with task-relevant patterns. As a result, CoDA-Nets model the classification prediction through a series of input-dependent linear transformations, allowing for linear decomposition of the output into individual input contributions. Given the alignment of the DAUs, the resulting contribution maps align with discriminative input patterns. These model-inherent decompositions are of high visual quality and outperform existing attribution methods under quantitative metrics. Further, CoDA-Nets constitute performant classifiers, achieving on par results to ResNet and VGG models on e.g. CIFAR-10 and TinyImagenet.

Cite

Text

Bohle et al. "Convolutional Dynamic Alignment Networks for Interpretable Classifications." Conference on Computer Vision and Pattern Recognition, 2021. doi:10.1109/CVPR46437.2021.00990

Markdown

[Bohle et al. "Convolutional Dynamic Alignment Networks for Interpretable Classifications." Conference on Computer Vision and Pattern Recognition, 2021.](https://mlanthology.org/cvpr/2021/bohle2021cvpr-convolutional/) doi:10.1109/CVPR46437.2021.00990

BibTeX

@inproceedings{bohle2021cvpr-convolutional,
  title     = {{Convolutional Dynamic Alignment Networks for Interpretable Classifications}},
  author    = {Bohle, Moritz and Fritz, Mario and Schiele, Bernt},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2021},
  pages     = {10029-10038},
  doi       = {10.1109/CVPR46437.2021.00990},
  url       = {https://mlanthology.org/cvpr/2021/bohle2021cvpr-convolutional/}
}