DeFormer: Integrating Transformers with Deformable Models for 3D Shape Abstraction from a Single Image

Abstract

Explicit 3D shape abstraction from a single 2D image is a long-standing problem in computer vision and graphics. By leveraging a set of primitives to represent the target shape, recent methods have achieved promising results. However, these methods either use a relatively larger number of primitives or lack geometric flexibility due to the low-dimensional expressibility of the primitives. In this paper, we propose a novel bi-channel Transformer architecture, integrated with parameterized deformable models, termed DeFormer, to simultaneously estimate global and local deformations. In this way, DeFormer can abstract complex object shapes while using a small number of primitives which offer a broader geometry coverage and finer details. Then, we introduce a force-driven dynamic fitting and a cycle-consistent re-projection loss to optimize the primitive parameters. Extensive experiments on ShapeNet across various settings show that DeFormer achieves better reconstruction accuracy over the state-of-the-art, and visualizes with consistent semantic correspondences for improved interpretability.

Cite

Text

Liu et al. "DeFormer: Integrating Transformers with Deformable Models for 3D Shape Abstraction from a Single Image." International Conference on Computer Vision, 2023. doi:10.1109/ICCV51070.2023.01309

Markdown

[Liu et al. "DeFormer: Integrating Transformers with Deformable Models for 3D Shape Abstraction from a Single Image." International Conference on Computer Vision, 2023.](https://mlanthology.org/iccv/2023/liu2023iccv-deformer/) doi:10.1109/ICCV51070.2023.01309

BibTeX

@inproceedings{liu2023iccv-deformer,
  title     = {{DeFormer: Integrating Transformers with Deformable Models for 3D Shape Abstraction from a Single Image}},
  author    = {Liu, Di and Yu, Xiang and Ye, Meng and Zhangli, Qilong and Li, Zhuowei and Zhang, Zhixing and Metaxas, Dimitris N.},
  booktitle = {International Conference on Computer Vision},
  year      = {2023},
  pages     = {14236-14246},
  doi       = {10.1109/ICCV51070.2023.01309},
  url       = {https://mlanthology.org/iccv/2023/liu2023iccv-deformer/}
}