LANA: Latency Aware Network Acceleration
Abstract
We introduce latency-aware network acceleration (LANA)-an approach that builds on neural architecture search technique to accelerate neural networks. LANA consists of two phases: in the first phase, it trains many alternative operations for every layer of a target network using layer-wise feature map distillation. In the second phase, it solves the combinatorial selection of efficient operations using a novel constrained integer linear optimization (ILP) approach. ILP brings unique properties as it (i) performs NAS within a few seconds to minutes, (ii) easily satisfies budget constraints, (iii) works on the layer-granularity, (iv) supports a huge search space O(10^100), surpassing prior search approaches in efficacy and efficiency. In extensive experiments, we show that LANA yields efficient and accurate models constrained by a target latency budget, while being significantly faster than other techniques. We analyze three popular network architectures: EfficientNetV1, EfficientNetV2 and ResNeST, and achieve accuracy improvement (up to 3.0%) for all models when compressing larger models. LANA achieves significant speed-ups (up to 5x) with minor to no accuracy drop on GPU and CPU.
Cite
Text
Molchanov et al. "LANA: Latency Aware Network Acceleration." Proceedings of the European Conference on Computer Vision (ECCV), 2022. doi:10.1007/978-3-031-19775-8_9Markdown
[Molchanov et al. "LANA: Latency Aware Network Acceleration." Proceedings of the European Conference on Computer Vision (ECCV), 2022.](https://mlanthology.org/eccv/2022/molchanov2022eccv-lana/) doi:10.1007/978-3-031-19775-8_9BibTeX
@inproceedings{molchanov2022eccv-lana,
title = {{LANA: Latency Aware Network Acceleration}},
author = {Molchanov, Pavlo and Hall, Jimmy and Yin, Hongxu and Kautz, Jan and Fusi, Nicolo and Vahdat, Arash},
booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
year = {2022},
doi = {10.1007/978-3-031-19775-8_9},
url = {https://mlanthology.org/eccv/2022/molchanov2022eccv-lana/}
}