Deep Surface Normal Estimation on the 2-Sphere with Confidence Guided Semantic Attention
Abstract
We propose a deep convolutional neural network (CNN) to estimate surface normal from a single color image accompanied with a low-quality depth channel. Unlike most previous works, we predict the normal on the 2-sphere rather than the 3D Euclidean space, which produces naturally normalized values and makes the training stable. Although the depth information is beneficial for normal estimation, the raw data contain missing values and noises. To alleviate this problem, we employ a confidence guided semantic attention (CGSA) module to progressively improve the quality of depth channel during training. The continuously refined depth features are fused with the normal features at multiple scales with the mutual feature fusion (MFF) modules to fully exploit the correlations between normals and depth, resulting in high quality normals and depth with fine details. Extensive experiments on multiple benchmark datasets prove the superiority of the proposed method.
Cite
Text
Li et al. "Deep Surface Normal Estimation on the 2-Sphere with Confidence Guided Semantic Attention." Proceedings of the European Conference on Computer Vision (ECCV), 2020. doi:10.1007/978-3-030-58586-0_43Markdown
[Li et al. "Deep Surface Normal Estimation on the 2-Sphere with Confidence Guided Semantic Attention." Proceedings of the European Conference on Computer Vision (ECCV), 2020.](https://mlanthology.org/eccv/2020/li2020eccv-deep-a/) doi:10.1007/978-3-030-58586-0_43BibTeX
@inproceedings{li2020eccv-deep-a,
title = {{Deep Surface Normal Estimation on the 2-Sphere with Confidence Guided Semantic Attention}},
author = {Li, Quewei and Guo, Jie and Fei, Yang and Tang, Qinyu and Sun, Wenxiu and Zeng, Jin and Guo, Yanwen},
booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
year = {2020},
doi = {10.1007/978-3-030-58586-0_43},
url = {https://mlanthology.org/eccv/2020/li2020eccv-deep-a/}
}