Multi-Resolution Monocular Depth mAP Fusion by Self-Supervised Gradient-Based Composition
Abstract
Monocular depth estimation is a challenging problem on which deep neural networks have demonstrated great potential. However, depth maps predicted by existing deep models usually lack fine-grained details due to convolution operations and down-samplings in networks. We find that increasing input resolution is helpful to preserve more local details while the estimation at low resolution is more accurate globally. Therefore, we propose a novel depth map fusion module to combine the advantages of estimations with multi-resolution inputs. Instead of merging the low- and high-resolution estimations equally, we adopt the core idea of Poisson fusion, trying to implant the gradient domain of high-resolution depth into the low-resolution depth. While classic Poisson fusion requires a fusion mask as supervision, we propose a self-supervised framework based on guided image filtering. We demonstrate that this gradient-based composition performs much better at noisy immunity, compared with the state-of-the-art depth map fusion method. Our lightweight depth fusion is one-shot and runs in real-time, making it 80X faster than a state-of-the-art depth fusion method. Quantitative evaluations demonstrate that the proposed method can be integrated into many fully convolutional monocular depth estimation backbones with a significant performance boost, leading to state-of-the-art results of detail enhancement on depth maps. Codes are released at https://github.com/yuinsky/gradient-based-depth-map-fusion.
Cite
Text
Dai et al. "Multi-Resolution Monocular Depth mAP Fusion by Self-Supervised Gradient-Based Composition." AAAI Conference on Artificial Intelligence, 2023. doi:10.1609/AAAI.V37I1.25123Markdown
[Dai et al. "Multi-Resolution Monocular Depth mAP Fusion by Self-Supervised Gradient-Based Composition." AAAI Conference on Artificial Intelligence, 2023.](https://mlanthology.org/aaai/2023/dai2023aaai-multi/) doi:10.1609/AAAI.V37I1.25123BibTeX
@inproceedings{dai2023aaai-multi,
title = {{Multi-Resolution Monocular Depth mAP Fusion by Self-Supervised Gradient-Based Composition}},
author = {Dai, Yaqiao and Yi, Renjiao and Zhu, Chenyang and He, Hongjun and Xu, Kai},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2023},
pages = {488-496},
doi = {10.1609/AAAI.V37I1.25123},
url = {https://mlanthology.org/aaai/2023/dai2023aaai-multi/}
}