Multi-Resolution Monocular Depth mAP Fusion by Self-Supervised Gradient-Based Composition

Dai, Yaqiao; Yi, Renjiao; Zhu, Chenyang; He, Hongjun; Xu, Kai

doi:10.1609/AAAI.V37I1.25123

Multi-Resolution Monocular Depth mAP Fusion by Self-Supervised Gradient-Based Composition

Yaqiao Dai, Renjiao Yi, Chenyang Zhu, Hongjun He, Kai Xu

AAAI 2023 pp. 488-496

doi:10.1609/AAAI.V37I1.25123 /aaai/2023/dai2023aaai-multi/

Abstract

Monocular depth estimation is a challenging problem on which deep neural networks have demonstrated great potential. However, depth maps predicted by existing deep models usually lack fine-grained details due to convolution operations and down-samplings in networks. We find that increasing input resolution is helpful to preserve more local details while the estimation at low resolution is more accurate globally. Therefore, we propose a novel depth map fusion module to combine the advantages of estimations with multi-resolution inputs. Instead of merging the low- and high-resolution estimations equally, we adopt the core idea of Poisson fusion, trying to implant the gradient domain of high-resolution depth into the low-resolution depth. While classic Poisson fusion requires a fusion mask as supervision, we propose a self-supervised framework based on guided image filtering. We demonstrate that this gradient-based composition performs much better at noisy immunity, compared with the state-of-the-art depth map fusion method. Our lightweight depth fusion is one-shot and runs in real-time, making it 80X faster than a state-of-the-art depth fusion method. Quantitative evaluations demonstrate that the proposed method can be integrated into many fully convolutional monocular depth estimation backbones with a significant performance boost, leading to state-of-the-art results of detail enhancement on depth maps. Codes are released at https://github.com/yuinsky/gradient-based-depth-map-fusion.

PDF AAAI Semantic Scholar

Cite

Text

Dai et al. "Multi-Resolution Monocular Depth mAP Fusion by Self-Supervised Gradient-Based Composition." AAAI Conference on Artificial Intelligence, 2023. doi:10.1609/AAAI.V37I1.25123

Markdown

[Dai et al. "Multi-Resolution Monocular Depth mAP Fusion by Self-Supervised Gradient-Based Composition." AAAI Conference on Artificial Intelligence, 2023.](https://mlanthology.org/aaai/2023/dai2023aaai-multi/) doi:10.1609/AAAI.V37I1.25123

BibTeX

@inproceedings{dai2023aaai-multi,
  title     = {{Multi-Resolution Monocular Depth mAP Fusion by Self-Supervised Gradient-Based Composition}},
  author    = {Dai, Yaqiao and Yi, Renjiao and Zhu, Chenyang and He, Hongjun and Xu, Kai},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2023},
  pages     = {488-496},
  doi       = {10.1609/AAAI.V37I1.25123},
  url       = {https://mlanthology.org/aaai/2023/dai2023aaai-multi/}
}