ELSE: Efficient Deep Neural Network Inference Through Line-Based Sparsity Exploration

Abstract

Brain-inspired computer architecture facilitates low-power, low-latency deep neural network inference for embedded AI applications. The hardware performance crucially hinges on the quantity of non-zero activations (i.e., events) during inference. Thus, we propose a novel event suppression method, dubbed ELSE, which enhances inference Efficiency via Line-based Sparsity Exploration. Specifically, it exploits spatial correlation between adjacent lines in activation maps to reduce network events. ELSE reduces event-triggered computations by 3.14∼6.49× for object detection and by 2.43∼5.75× for pose estimation across various network architectures compared to conventional processing. Additionally, we show that combining ELSE with other event suppression methods can either significantly enhance computation savings for spatial suppression or reduce state memory footprint by > 2× for temporal suppression. The latter alleviates the challenge of temporal execution exceeding the resource constraints of real-world embedded platforms. These results highlight ELSE’s significant event suppression ability and its capacity to deliver complementary performance enhancements for SOTA methods.

Cite

Text

Zhu et al. "ELSE: Efficient Deep Neural Network Inference Through Line-Based Sparsity Exploration." Proceedings of the European Conference on Computer Vision (ECCV), 2024. doi:10.1007/978-3-031-73247-8_24

Markdown

[Zhu et al. "ELSE: Efficient Deep Neural Network Inference Through Line-Based Sparsity Exploration." Proceedings of the European Conference on Computer Vision (ECCV), 2024.](https://mlanthology.org/eccv/2024/zhu2024eccv-else/) doi:10.1007/978-3-031-73247-8_24

BibTeX

@inproceedings{zhu2024eccv-else,
  title     = {{ELSE: Efficient Deep Neural Network Inference Through Line-Based Sparsity Exploration}},
  author    = {Zhu, Zeqi and Garcia-Ortiz, Alberto and Waeijen, Luc and Bondarev, Egor and Pourtaherian, Arash and Moreira, Orlando},
  booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
  year      = {2024},
  doi       = {10.1007/978-3-031-73247-8_24},
  url       = {https://mlanthology.org/eccv/2024/zhu2024eccv-else/}
}