Monocular Depth Estimation with Adaptive Geometric Attention

Naderi, Taher; Sadovnik, Amir; Hayward, Jason; Qi, Hairong

Monocular Depth Estimation with Adaptive Geometric Attention

Taher Naderi, Amir Sadovnik, Jason Hayward, Hairong Qi

WACV 2022 pp. 944-954

/wacv/2022/naderi2022wacv-monocular/

Abstract

Single image depth estimation is an ill-posed problem. That is, it is not mathematically possible to uniquely estimate the 3rd dimension (or depth) from a single 2D image. Hence, additional constraints need to be incorporated in order to regulate the solution space. In this paper, we explore the idea of constraining the model by taking advantage of the similarity between the RGB image and the corresponding depth map at the geometric edges of the 3D scene for more accurate depth estimation. We propose a general light-weight adaptive geometric attention module that uses the cross-correlation between the encoder and the decoder as a measure of this similarity. More precisely, we use the cosine similarity between the local embedded features in the encoder and the decoder at each spatial point. The proposed module along with the encoder-decoder network is trained in an end-to-end fashion and achieves superior and competitive performance in comparison with other state-of-the-art methods. In addition, adding our module to the base encoder-decoder model adds only an additional 0.03% (or 0.0003) parameters. Therefore, this module can be added to any base encoder-decoder network without changing its structure to address any task at hand.

PDF WACV Semantic Scholar

Cite

Text

Naderi et al. "Monocular Depth Estimation with Adaptive Geometric Attention." Winter Conference on Applications of Computer Vision, 2022.

Markdown

[Naderi et al. "Monocular Depth Estimation with Adaptive Geometric Attention." Winter Conference on Applications of Computer Vision, 2022.](https://mlanthology.org/wacv/2022/naderi2022wacv-monocular/)

BibTeX

@inproceedings{naderi2022wacv-monocular,
  title     = {{Monocular Depth Estimation with Adaptive Geometric Attention}},
  author    = {Naderi, Taher and Sadovnik, Amir and Hayward, Jason and Qi, Hairong},
  booktitle = {Winter Conference on Applications of Computer Vision},
  year      = {2022},
  pages     = {944-954},
  url       = {https://mlanthology.org/wacv/2022/naderi2022wacv-monocular/}
}