Efficient Decompositional Rule Extraction for Deep Neural Networks
Abstract
In recent years, there has been significant work on increasing both interpretability and debuggability of a Deep Neural Network (DNN) by extracting a rule-based model that approximates its decision boundary. Nevertheless, current DNN rule extraction methods that consider a DNN’s latent space when extracting rules, known as decompositional algorithms, are either restricted to single-layer DNNs or intractable as the size of the DNN or data grows. In this paper, we address these limitations by introducing ECLAIRE, a novel polynomial-time rule extraction algorithm capable of scaling to both large DNN architectures and large training datasets. We evaluate ECLAIRE on a wide variety of tasks, ranging from breast cancer prognosis to particle detection, and show that it consistently extracts more accurate and comprehensible rule sets than the current state-of-the-art methods while using orders of magnitude less computational resources. We make all of our methods available, including a rule set visualisation interface, through the open-source REMIX library (https://github.com/mateoespinosa/remix).
Cite
Text
Zarlenga et al. "Efficient Decompositional Rule Extraction for Deep Neural Networks." NeurIPS 2021 Workshops: XAI4Debugging, 2021.Markdown
[Zarlenga et al. "Efficient Decompositional Rule Extraction for Deep Neural Networks." NeurIPS 2021 Workshops: XAI4Debugging, 2021.](https://mlanthology.org/neuripsw/2021/zarlenga2021neuripsw-efficient/)BibTeX
@inproceedings{zarlenga2021neuripsw-efficient,
title = {{Efficient Decompositional Rule Extraction for Deep Neural Networks}},
author = {Zarlenga, Mateo Espinosa and Shams, Zohreh and Jamnik, Mateja},
booktitle = {NeurIPS 2021 Workshops: XAI4Debugging},
year = {2021},
url = {https://mlanthology.org/neuripsw/2021/zarlenga2021neuripsw-efficient/}
}