Multi-Scale Attention-Based Inclination Angles Estimation for Panoramic Camera

Abstract

Images taken by panoramic cameras in the upright posture can give viewers a better sense and make the downstream panoramic image-based computer vision tasks easier. To estimate the inclination angles of panoramic camera, we proposed a simple but elegant panoramic image-based network, which combines the advantages of geometry-based and deep-learning-based methods. First, a backbone network with five down-sampling layers is designed to focus on the local distortion features. Then, since non-upright panoramic images have highly uniform geometric distortion for the same camera inclination angles, a multi-scale attention module is proposed for the first time, which can weigh each pixel on the feature maps of the backbone network and allows the network to focus on the global and shallow geometric features. Moreover, apart from angle loss, pixel-level image loss is introduced in our network for the inclination angles estimation task to allow the network to compensate for pixel deviations during training. The experiments show that our method overcomes other leading state-of-the-art methods in this field.

Cite

Text

Shan et al. "Multi-Scale Attention-Based Inclination Angles Estimation for Panoramic Camera." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2024. doi:10.1109/CVPRW63382.2024.00139

Markdown

[Shan et al. "Multi-Scale Attention-Based Inclination Angles Estimation for Panoramic Camera." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2024.](https://mlanthology.org/cvprw/2024/shan2024cvprw-multiscale/) doi:10.1109/CVPRW63382.2024.00139

BibTeX

@inproceedings{shan2024cvprw-multiscale,
  title     = {{Multi-Scale Attention-Based Inclination Angles Estimation for Panoramic Camera}},
  author    = {Shan, Yuhao and Chen, Heyu and Zhang, Jiaying and Li, Shigang and Li, Jianfeng},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
  year      = {2024},
  pages     = {1322-1330},
  doi       = {10.1109/CVPRW63382.2024.00139},
  url       = {https://mlanthology.org/cvprw/2024/shan2024cvprw-multiscale/}
}