Streaming Weak Submodularity: Interpreting Neural Networks on the Fly
Abstract
In many machine learning applications, it is important to explain the predictions of a black-box classifier. For example, why does a deep neural network assign an image to a particular class? We cast interpretability of black-box classifiers as a combinatorial maximization problem and propose an efficient streaming algorithm to solve it subject to cardinality constraints. By extending ideas from Badanidiyuru et al. [2014], we provide a constant factor approximation guarantee for our algorithm in the case of random stream order and a weakly submodular objective function. This is the first such theoretical guarantee for this general class of functions, and we also show that no such algorithm exists for a worst case stream order. Our algorithm obtains similar explanations of Inception V3 predictions 10 times faster than the state-of-the-art LIME framework of Ribeiro et al. [2016].
Cite
Text
Elenberg et al. "Streaming Weak Submodularity: Interpreting Neural Networks on the Fly." Neural Information Processing Systems, 2017.Markdown
[Elenberg et al. "Streaming Weak Submodularity: Interpreting Neural Networks on the Fly." Neural Information Processing Systems, 2017.](https://mlanthology.org/neurips/2017/elenberg2017neurips-streaming/)BibTeX
@inproceedings{elenberg2017neurips-streaming,
title = {{Streaming Weak Submodularity: Interpreting Neural Networks on the Fly}},
author = {Elenberg, Ethan and Dimakis, Alexandros G and Feldman, Moran and Karbasi, Amin},
booktitle = {Neural Information Processing Systems},
year = {2017},
pages = {4044-4054},
url = {https://mlanthology.org/neurips/2017/elenberg2017neurips-streaming/}
}