Aligning Artificial Neural Networks and Ontologies Towards Explainable AI

Abstract

Neural networks have been the key to solve a variety of different problems. However, neural network models are still regarded as black boxes, since they do not provide any human-interpretable evidence as to why they output a certain result. We address this issue by leveraging on ontologies and building small classifiers that map a neural network model's internal state to concepts from an ontology, enabling the generation of symbolic justifications for the output of neural network models. Using an image classification problem as testing ground, we discuss how to map the internal state of a neural network to the concepts of an ontology, examine whether the results obtained by the established mappings match our understanding of the mapped concepts, and analyze the justifications obtained through this method.

Cite

Text

de Sousa Ribeiro and Leite. "Aligning Artificial Neural Networks and Ontologies Towards Explainable AI." AAAI Conference on Artificial Intelligence, 2021. doi:10.1609/AAAI.V35I6.16626

Markdown

[de Sousa Ribeiro and Leite. "Aligning Artificial Neural Networks and Ontologies Towards Explainable AI." AAAI Conference on Artificial Intelligence, 2021.](https://mlanthology.org/aaai/2021/desousaribeiro2021aaai-aligning/) doi:10.1609/AAAI.V35I6.16626

BibTeX

@inproceedings{desousaribeiro2021aaai-aligning,
  title     = {{Aligning Artificial Neural Networks and Ontologies Towards Explainable AI}},
  author    = {de Sousa Ribeiro, Manuel and Leite, João},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2021},
  pages     = {4932-4940},
  doi       = {10.1609/AAAI.V35I6.16626},
  url       = {https://mlanthology.org/aaai/2021/desousaribeiro2021aaai-aligning/}
}