Sim-to-Real 6d Object Pose Estimation via Iterative Self-Training for Robotic Bin Picking

Abstract

6D object pose estimation is important for robotic bin-picking, and serves as a prerequisite for many downstream industrial applications. However, it is burdensome to annotate a customized dataset associated with each specific bin-picking scenario for training pose estimation models. In this paper, we propose an iterative self-training framework for sim-to-real 6D object pose estimation to facilitate cost-effective robotic grasping. Given a bin-picking scenario, we establish a photo-realistic simulator to synthesize abundant virtual data, and use this to train an initial pose estimation network. This network then takes the role of a teacher model, which generates pose predictions for unlabeled real data. With these predictions, we further design a comprehensive adaptive selection scheme to distinguish reliable results, and leverage them as pseudo labels to update a student model for pose estimation on real data. To continuously improve the quality of pseudo labels, we iterate the above steps by taking the trained student model as a new teacher and re-label real data using the refined teacher model. We evaluate our method on a public benchmark and our newly-released dataset, achieving an ADD(-S) improvement of 11.49% and 22.62% respectively. Our method is also able to improve robotic bin-picking success by 19.54%, demonstrating the potential of iterative sim-to-real solutions for robotic applications. Project homepage: www.cse.cuhk.edu.hk/ kaichen/sim2real_pose.html.

Cite

Text

Chen et al. "Sim-to-Real 6d Object Pose Estimation via Iterative Self-Training for Robotic Bin Picking." Proceedings of the European Conference on Computer Vision (ECCV), 2022. doi:10.1007/978-3-031-19842-7_31

Markdown

[Chen et al. "Sim-to-Real 6d Object Pose Estimation via Iterative Self-Training for Robotic Bin Picking." Proceedings of the European Conference on Computer Vision (ECCV), 2022.](https://mlanthology.org/eccv/2022/chen2022eccv-simtoreal/) doi:10.1007/978-3-031-19842-7_31

BibTeX

@inproceedings{chen2022eccv-simtoreal,
  title     = {{Sim-to-Real 6d Object Pose Estimation via Iterative Self-Training for Robotic Bin Picking}},
  author    = {Chen, Kai and Cao, Rui and James, Stephen and Li, Yichuan and Liu, Yun-Hui and Abbeel, Pieter and Dou, Qi},
  booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
  year      = {2022},
  doi       = {10.1007/978-3-031-19842-7_31},
  url       = {https://mlanthology.org/eccv/2022/chen2022eccv-simtoreal/}
}