ELSE: Efficient Deep Neural Network Inference Through Line-Based Sparsity Exploration
Abstract
Brain-inspired computer architecture facilitates low-power, low-latency deep neural network inference for embedded AI applications. The hardware performance crucially hinges on the quantity of non-zero activations (i.e., events) during inference. Thus, we propose a novel event suppression method, dubbed ELSE, which enhances inference Efficiency via Line-based Sparsity Exploration. Specifically, it exploits spatial correlation between adjacent lines in activation maps to reduce network events. ELSE reduces event-triggered computations by 3.14∼6.49× for object detection and by 2.43∼5.75× for pose estimation across various network architectures compared to conventional processing. Additionally, we show that combining ELSE with other event suppression methods can either significantly enhance computation savings for spatial suppression or reduce state memory footprint by > 2× for temporal suppression. The latter alleviates the challenge of temporal execution exceeding the resource constraints of real-world embedded platforms. These results highlight ELSE’s significant event suppression ability and its capacity to deliver complementary performance enhancements for SOTA methods.
Cite
Text
Zhu et al. "ELSE: Efficient Deep Neural Network Inference Through Line-Based Sparsity Exploration." Proceedings of the European Conference on Computer Vision (ECCV), 2024. doi:10.1007/978-3-031-73247-8_24Markdown
[Zhu et al. "ELSE: Efficient Deep Neural Network Inference Through Line-Based Sparsity Exploration." Proceedings of the European Conference on Computer Vision (ECCV), 2024.](https://mlanthology.org/eccv/2024/zhu2024eccv-else/) doi:10.1007/978-3-031-73247-8_24BibTeX
@inproceedings{zhu2024eccv-else,
title = {{ELSE: Efficient Deep Neural Network Inference Through Line-Based Sparsity Exploration}},
author = {Zhu, Zeqi and Garcia-Ortiz, Alberto and Waeijen, Luc and Bondarev, Egor and Pourtaherian, Arash and Moreira, Orlando},
booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
year = {2024},
doi = {10.1007/978-3-031-73247-8_24},
url = {https://mlanthology.org/eccv/2024/zhu2024eccv-else/}
}