Learning Canonical Shape Space for Category-Level 6d Object Pose and Size Estimation

Chen, Dengsheng; Li, Jun; Wang, Zheng; Xu, Kai

doi:10.1109/CVPR42600.2020.01199

Learning Canonical Shape Space for Category-Level 6d Object Pose and Size Estimation

Dengsheng Chen, Jun Li, Zheng Wang, Kai Xu

CVPR 2020

doi:10.1109/CVPR42600.2020.01199 /cvpr/2020/chen2020cvpr-learning-a/

Abstract

We present a novel approach to category-level 6D object pose and size estimation. To tackle intra-class shape variations, we learn canonical shape space (CASS), a unified representation for a large variety of instances of a certain object category. In particular, CASS is modeled as the latent space of a deep generative model of canonical 3D shapes with normalized pose. We train a variational auto-encoder (VAE) for generating 3D point clouds in the canonical space from an RGBD image. The VAE is trained in a cross-category fashion, exploiting the publicly available large 3D shape repositories. Since the 3D point cloud is generated in normalized pose (with actual size), the encoder of the VAE learns view-factorized RGBD embedding. It maps an RGBD image in arbitrary view into a poseindependent 3D shape representation. Object pose is then estimated via contrasting it with a pose-dependent feature of the input RGBD extracted with a separate deep neural networks. We integrate the learning of CASS and pose and size estimation into an end-to-end trainable network, achieving the state-of-the-art performance.

PDF CVPR Semantic Scholar

Cite

Text

Chen et al. "Learning Canonical Shape Space for Category-Level 6d Object Pose and Size Estimation." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020. doi:10.1109/CVPR42600.2020.01199

Markdown

[Chen et al. "Learning Canonical Shape Space for Category-Level 6d Object Pose and Size Estimation." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020.](https://mlanthology.org/cvpr/2020/chen2020cvpr-learning-a/) doi:10.1109/CVPR42600.2020.01199

BibTeX

@inproceedings{chen2020cvpr-learning-a,
  title     = {{Learning Canonical Shape Space for Category-Level 6d Object Pose and Size Estimation}},
  author    = {Chen, Dengsheng and Li, Jun and Wang, Zheng and Xu, Kai},
  booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year      = {2020},
  doi       = {10.1109/CVPR42600.2020.01199},
  url       = {https://mlanthology.org/cvpr/2020/chen2020cvpr-learning-a/}
}