CATS: Combined Activation and Temporal Suppression for Efficient Network Inference
Abstract
Brain-inspired event-driven processors execute deep neural networks (DNNs) in an event sparsity-aware manner, leading to superior performance compared to conventional platforms. In the pursuit of higher event sparsity, prior studies suppress non-zero events by either eliminating the intra-frame activations (spatially) or leveraging the redundancy in the inter-frame differences for a video (temporally). However, we have empirically observed that simultaneously enhancing activation and temporal sparsity can lead to a synergistic suppression outcome. To this end, we propose an end-to-end event suppression training approach CATS -- Combined Activation and Temporal Suppression for efficient network inference. It utilizes a gradient-based method to search for the optimal temporal thresholds for the network while penalizing the presence of non-zero events in spatial and temporal domains simultaneously. We demonstrate that CATS achieves 2 6 times more event suppression compared to the inherent ReLU suppression, consistently outperforming the SOTA by a significant margin at various accuracy levels. Extensive experimental results show that CATS also generalizes to multiple tasks -- object detection, object tracking, pose estimation, and semantic segmentation. Furthermore, a case study for the commercial event-driven processor GrAI-VIP highlights that the induced event sparsity in SSD on EgoHands datasets efficiently translates into significant improvements of 2.5 x in FPS, 2.1 x in latency, and 3.8 x in energy consumption, while maintaining the model accuracy.
Cite
Text
Zhu et al. "CATS: Combined Activation and Temporal Suppression for Efficient Network Inference." Winter Conference on Applications of Computer Vision, 2024.Markdown
[Zhu et al. "CATS: Combined Activation and Temporal Suppression for Efficient Network Inference." Winter Conference on Applications of Computer Vision, 2024.](https://mlanthology.org/wacv/2024/zhu2024wacv-cats/)BibTeX
@inproceedings{zhu2024wacv-cats,
title = {{CATS: Combined Activation and Temporal Suppression for Efficient Network Inference}},
author = {Zhu, Zeqi and Pourtaherian, Arash and Waeijen, Luc and Akkaya, Ibrahim Batuhan and Bondarev, Egor and Moreira, Orlando},
booktitle = {Winter Conference on Applications of Computer Vision},
year = {2024},
pages = {8166-8175},
url = {https://mlanthology.org/wacv/2024/zhu2024wacv-cats/}
}