PR-GCN: A Deep Graph Convolutional Network with Point Refinement for 6d Pose Estimation

Abstract

RGB-D based 6D pose estimation has recently achieved remarkable progress, but still suffers from two major limitations: (1) ineffective representation of depth data and (2) insufficient integration of different modalities. This paper proposes a novel deep learning approach, namely Graph Convolutional Network with Point Refinement (PR-GCN), to simultaneously address the issues above in a unified way. It first introduces the Point Refinement Network (PRN) to polish 3D point clouds, recovering missing parts with noise removed. Subsequently, the Multi-Modal Fusion Graph Convolutional Network (MMF-GCN) is presented to strengthen RGB-D combination, which captures geometry-aware inter-modality correlation through local information propagation in the graph convolutional network. Extensive experiments are conducted on three widely used benchmarks, and state-of-the-art performance is reached. Besides, it is also shown that the proposed PRN and MMF-GCN modules are well generalized to other frameworks.

Cite

Text

Zhou et al. "PR-GCN: A Deep Graph Convolutional Network with Point Refinement for 6d Pose Estimation." International Conference on Computer Vision, 2021. doi:10.1109/ICCV48922.2021.00279

Markdown

[Zhou et al. "PR-GCN: A Deep Graph Convolutional Network with Point Refinement for 6d Pose Estimation." International Conference on Computer Vision, 2021.](https://mlanthology.org/iccv/2021/zhou2021iccv-prgcn/) doi:10.1109/ICCV48922.2021.00279

BibTeX

@inproceedings{zhou2021iccv-prgcn,
  title     = {{PR-GCN: A Deep Graph Convolutional Network with Point Refinement for 6d Pose Estimation}},
  author    = {Zhou, Guangyuan and Wang, Huiqun and Chen, Jiaxin and Huang, Di},
  booktitle = {International Conference on Computer Vision},
  year      = {2021},
  pages     = {2793-2802},
  doi       = {10.1109/ICCV48922.2021.00279},
  url       = {https://mlanthology.org/iccv/2021/zhou2021iccv-prgcn/}
}