Towards Comprehensive Representation Enhancement in Semantics-Guided Self-Supervised Monocular Depth Estimation
Abstract
Semantics-guided self-supervised monocular depth estimation has been widely researched, owing to the strong cross-task correlation of depth and semantics. However, since depth estimation and semantic segmentation are fundamentally two types of tasks: one is regression while the other is classification, the distribution of depth feature and semantic feature are naturally different. Previous works that leverage semantic information in depth estimation mostly neglect such representational discrimination, which leads to insufficient representation enhancement of depth feature. In this work, we propose an attention-based module to enhance task-specific feature by addressing their feature uniqueness within instances. Additionally, we propose a metric learning based approach to accomplish comprehensive enhancement on depth feature by creating a separation between instances in feature space. Extensive experiments and analysis demonstrate the effectiveness of our proposed method. In the end, our method achieves the state-of-the-art performance on KITTI dataset.
Cite
Text
Ma et al. "Towards Comprehensive Representation Enhancement in Semantics-Guided Self-Supervised Monocular Depth Estimation." Proceedings of the European Conference on Computer Vision (ECCV), 2022. doi:10.1007/978-3-031-19769-7_18Markdown
[Ma et al. "Towards Comprehensive Representation Enhancement in Semantics-Guided Self-Supervised Monocular Depth Estimation." Proceedings of the European Conference on Computer Vision (ECCV), 2022.](https://mlanthology.org/eccv/2022/ma2022eccv-comprehensive/) doi:10.1007/978-3-031-19769-7_18BibTeX
@inproceedings{ma2022eccv-comprehensive,
title = {{Towards Comprehensive Representation Enhancement in Semantics-Guided Self-Supervised Monocular Depth Estimation}},
author = {Ma, Jingyuan and Lei, Xiangyu and Liu, Nan and Zhao, Xian and Pu, Shiliang},
booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
year = {2022},
doi = {10.1007/978-3-031-19769-7_18},
url = {https://mlanthology.org/eccv/2022/ma2022eccv-comprehensive/}
}