Compiler-Aware Neural Architecture Search for On-Mobile Real-Time Super-Resolution

Abstract

Deep learning-based super-resolution (SR) has gained tremendous popularity in recent years because of its high image quality performance and wide application scenarios. However, prior methods typically suffer from large amounts of computations and huge power consumption, causing difficulties for real-time inference, especially on resourcelimited platforms such as mobile devices. To mitigate this, we propose a compiler-aware SR neural architecture search (NAS) framework that conducts depth search and per-layer width search with adaptive SR blocks. The inference speed is directly taken into the optimization along with the SR loss to derive SR models with high image quality while satisfying the real-time inference requirement. Instead of measuring the speed on mobile devices at each iteration during the search process, a speed model incorporated with compiler optimizations is leveraged to predict the inference latency of the SR block with various width configurations for faster convergence. With the proposed framework, we achieve realtime SR inference for implementing 720p resolution with competitive SR performance (in terms of PSNR and SSIM) on GPU/DSP of mobile platforms (Samsung Galaxy S21).

Cite

Text

Wu et al. "Compiler-Aware Neural Architecture Search for On-Mobile Real-Time Super-Resolution." Proceedings of the European Conference on Computer Vision (ECCV), 2022. doi:10.1007/978-3-031-19800-7_6

Markdown

[Wu et al. "Compiler-Aware Neural Architecture Search for On-Mobile Real-Time Super-Resolution." Proceedings of the European Conference on Computer Vision (ECCV), 2022.](https://mlanthology.org/eccv/2022/wu2022eccv-compileraware/) doi:10.1007/978-3-031-19800-7_6

BibTeX

@inproceedings{wu2022eccv-compileraware,
  title     = {{Compiler-Aware Neural Architecture Search for On-Mobile Real-Time Super-Resolution}},
  author    = {Wu, Yushu and Gong, Yifan and Zhao, Pu and Li, Yanyu and Zhan, Zheng and Niu, Wei and Tang, Hao and Qin, Minghai and Ren, Bin and Wang, Yanzhi},
  booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
  year      = {2022},
  doi       = {10.1007/978-3-031-19800-7_6},
  url       = {https://mlanthology.org/eccv/2022/wu2022eccv-compileraware/}
}