Category-Level 6d Object Pose Estimation in the Wild: A Semi-Supervised Learning Approach and a New Dataset

Abstract

6D object pose estimation is one of the fundamental problems in computer vision and robotics research. While a lot of recent efforts have been made on generalizing pose estimation to novel object instances within the same category, namely category-level 6D pose estimation, it is still restricted in constrained environments given the limited number of annotated data. In this paper, we collect Wild6D, a new unlabeled RGBD object video dataset with diverse instances and backgrounds. We utilize this data to generalize category-level 6D object pose estimation in the wild with semi-supervised learning. We propose a new model, called \textbf{Re}ndering for \textbf{Po}se estimation network (\textbf{RePoNet}), that is jointly trained using the free ground-truths with the synthetic data, and a silhouette matching objective function on the real-world data. Without using any 3D annotations on real data, our method outperforms state-of-the-art methods on the previous dataset and our Wild6D test set (with manual annotations for evaluation) by a large margin. Our code and dataset will be made publicly available.

Cite

Text

Fu and Wang. "Category-Level 6d Object Pose Estimation in the Wild: A Semi-Supervised Learning Approach and a New Dataset." NeurIPS 2022 Workshops: NeurReps, 2022.

Markdown

[Fu and Wang. "Category-Level 6d Object Pose Estimation in the Wild: A Semi-Supervised Learning Approach and a New Dataset." NeurIPS 2022 Workshops: NeurReps, 2022.](https://mlanthology.org/neuripsw/2022/fu2022neuripsw-categorylevel/)

BibTeX

@inproceedings{fu2022neuripsw-categorylevel,
  title     = {{Category-Level 6d Object Pose Estimation in the Wild: A Semi-Supervised Learning Approach and a New Dataset}},
  author    = {Fu, Yang and Wang, Xiaolong},
  booktitle = {NeurIPS 2022 Workshops: NeurReps},
  year      = {2022},
  url       = {https://mlanthology.org/neuripsw/2022/fu2022neuripsw-categorylevel/}
}