CSAnet: High Speed Channel Spatial Attention Network for Mobile ISP

Abstract

The Image Signal Processor (ISP) is a customized device to restore RGB images from the pixel signals of CMOS image sensor. In order to realize this function, a series of processing units are leveraged to tackle different artifacts, such as color shifts, signal noise, moire effects, and so on, that are introduced from the photo-capturing devices. However, tuning each processing unit is highly complicated and requires a lot of experience and effort from image experts. In this paper, a novel network architecture, CSANet, with emphases on inference speed and high PSNR is proposed for end-to-end learned ISP task. The proposed CSANet applies a double attention module employing both channel and spatial attentions. Particularly, its spatial attention is simplified to a light-weighted dilated depth-wise convolution and still performs as well as others. As proof of performance, CSANet won 2nd place in the Mobile AI 2021 Learned Smartphone ISP Challenge with 1st place PSNR score.

Cite

Text

Hsyu et al. "CSAnet: High Speed Channel Spatial Attention Network for Mobile ISP." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2021. doi:10.1109/CVPRW53098.2021.00282

Markdown

[Hsyu et al. "CSAnet: High Speed Channel Spatial Attention Network for Mobile ISP." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2021.](https://mlanthology.org/cvprw/2021/hsyu2021cvprw-csanet/) doi:10.1109/CVPRW53098.2021.00282

BibTeX

@inproceedings{hsyu2021cvprw-csanet,
  title     = {{CSAnet: High Speed Channel Spatial Attention Network for Mobile ISP}},
  author    = {Hsyu, Ming-Chun and Liu, Chih-Wei and Chen, Chao-Hung and Chen, Chao-Wei and Tsai, Wen-Chia},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
  year      = {2021},
  pages     = {2486-2493},
  doi       = {10.1109/CVPRW53098.2021.00282},
  url       = {https://mlanthology.org/cvprw/2021/hsyu2021cvprw-csanet/}
}