Deep Closest Point: Learning Representations for Point Cloud Registration

Abstract

Point cloud registration is a key problem for computer vision applied to robotics, medical imaging, and other applications. This problem involves finding a rigid transformation from one point cloud into another so that they align. Iterative Closest Point (ICP) and its variants provide simple and easily-implemented iterative methods for this task, but these algorithms can converge to spurious local optima. To address local optima and other difficulties in the ICP pipeline, we propose a learning-based method, titled Deep Closest Point (DCP), inspired by recent techniques in computer vision and natural language processing. Our model consists of three parts: a point cloud embedding network, an attention-based module combined with a pointer generation layer to approximate combinatorial matching, and a differentiable singular value decomposition (SVD) layer to extract the final rigid transformation. We train our model end-to-end on the ModelNet40 dataset and show in several settings that it performs better than ICP, its variants (e.g., Go-ICP, FGR), and the recently-proposed learning-based method PointNetLK. Beyond providing a state-of-the-art registration technique, we evaluate the suitability of our learned features transferred to unseen objects. We also provide preliminary analysis of our learned model to help understand whether domain-specific and/or global features facilitate rigid registration.

Cite

Text

Wang and Solomon. "Deep Closest Point: Learning Representations for Point Cloud Registration." Proceedings of the IEEE/CVF International Conference on Computer Vision, 2019. doi:10.1109/ICCV.2019.00362

Markdown

[Wang and Solomon. "Deep Closest Point: Learning Representations for Point Cloud Registration." Proceedings of the IEEE/CVF International Conference on Computer Vision, 2019.](https://mlanthology.org/iccv/2019/wang2019iccv-deep/) doi:10.1109/ICCV.2019.00362

BibTeX

@inproceedings{wang2019iccv-deep,
  title     = {{Deep Closest Point: Learning Representations for Point Cloud Registration}},
  author    = {Wang, Yue and Solomon, Justin M.},
  booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision},
  year      = {2019},
  doi       = {10.1109/ICCV.2019.00362},
  url       = {https://mlanthology.org/iccv/2019/wang2019iccv-deep/}
}