CostDCNet: Cost Volume Based Depth Completion for a Single RGB-D Image
Abstract
Successful depth completion from a single RGB-D image requires both extracting plentiful 2D and 3D features and merging these heterogeneous features appropriately. We propose a novel depth completion framework, CostDCNet, based on the cost volume-based depth estimation approach that has been successfully employed for multi-view stereo (MVS). The key to high-quality depth map estimation in the approach is constructing an accurate cost volume. To produce a quality cost volume tailored to single-view depth completion, we present a simple but effective architecture that can fully exploit the 3D information, three options to make an RGB-D feature volume, and per-plane pixel shuffle for efficient volume upsampling. Our CostDCNet framework consists of lightweight deep neural networks ( 1.8M parameters), running in real time ( 30ms). Nevertheless, thanks to our simple but effective design, CostDCNet demonstrates depth completion results comparable to or better than the state-of-the-art methods.
Cite
Text
Kam et al. "CostDCNet: Cost Volume Based Depth Completion for a Single RGB-D Image." Proceedings of the European Conference on Computer Vision (ECCV), 2022. doi:10.1007/978-3-031-20086-1_15Markdown
[Kam et al. "CostDCNet: Cost Volume Based Depth Completion for a Single RGB-D Image." Proceedings of the European Conference on Computer Vision (ECCV), 2022.](https://mlanthology.org/eccv/2022/kam2022eccv-costdcnet/) doi:10.1007/978-3-031-20086-1_15BibTeX
@inproceedings{kam2022eccv-costdcnet,
title = {{CostDCNet: Cost Volume Based Depth Completion for a Single RGB-D Image}},
author = {Kam, Jaewon and Kim, Jungeon and Kim, Soongjin and Park, Jaesik and Lee, Seungyong},
booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
year = {2022},
doi = {10.1007/978-3-031-20086-1_15},
url = {https://mlanthology.org/eccv/2022/kam2022eccv-costdcnet/}
}