Shape-Pose Disentanglement Using SE(3)-Equivariant Vector Neurons

Abstract

We introduce an unsupervised technique for encoding point clouds into a canonical shape representation, by disentangling shape and pose. Our encoder is stable and consistent, meaning that the shape encoding is purely pose-invariant, while the extracted rotation and translation are able to semantically align different input shapes of the same class to a common canonical pose. Specifically, we design an auto-encoder based on Vector Neuron Networks, a rotation-equivariant neural network, whose layers we extend to provide translation-equivariance in addition to rotation-equivariance only. The resulting encoder produces pose-invariant shape encoding by construction, enabling our approach to focus on learning a consistent canonical pose for a class of objects. Quantitative and qualitative experiments validate the superior stability and consistency of our approach.

Cite

Text

Katzir et al. "Shape-Pose Disentanglement Using SE(3)-Equivariant Vector Neurons." Proceedings of the European Conference on Computer Vision (ECCV), 2022. doi:10.1007/978-3-031-20062-5_27

Markdown

[Katzir et al. "Shape-Pose Disentanglement Using SE(3)-Equivariant Vector Neurons." Proceedings of the European Conference on Computer Vision (ECCV), 2022.](https://mlanthology.org/eccv/2022/katzir2022eccv-shapepose/) doi:10.1007/978-3-031-20062-5_27

BibTeX

@inproceedings{katzir2022eccv-shapepose,
  title     = {{Shape-Pose Disentanglement Using SE(3)-Equivariant Vector Neurons}},
  author    = {Katzir, Oren and Lischinski, Dani and Cohen-Or, Daniel},
  booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
  year      = {2022},
  doi       = {10.1007/978-3-031-20062-5_27},
  url       = {https://mlanthology.org/eccv/2022/katzir2022eccv-shapepose/}
}