Zero-Shot Inexact CAD Model Alignment from a Single Image

Abstract

One practical approach to infer 3D scene structure from a single image is to retrieve a closely matching 3D model from a database and align it with the object in the image. Existing methods rely on supervised training with images and pose annotations, which limits them to a narrow set of object categories. To address this, we propose a weakly supervised 9-DoF alignment method for inexact 3D models that requires no scene-level pose annotations and generalizes to unseen categories. Our approach derives a novel feature space based on foundation features that ensure multi-view consistency and overcome symmetry ambiguities inherent in foundation features using a self-supervised triplet loss. Additionally, we introduce a texture-invariant pose refinement technique that performs dense alignment in normalized object coordinates, estimated through the enhanced feature space. We conduct extensive evaluations on the real-world ScanNet25k dataset, where our method outperforms SOTA weakly supervised baselines by +4.3% mean alignment accuracy and is the only weakly supervised approach to surpass the supervised ROCA by +2.7%. To assess generalization, we introduce SUN2CAD, a real-world test set with 20 novel object categories, where our method achieves SOTA results without prior training on them.

Cite

Text

Arsomngern et al. "Zero-Shot Inexact CAD Model Alignment from a Single Image." International Conference on Computer Vision, 2025.

Markdown

[Arsomngern et al. "Zero-Shot Inexact CAD Model Alignment from a Single Image." International Conference on Computer Vision, 2025.](https://mlanthology.org/iccv/2025/arsomngern2025iccv-zeroshot/)

BibTeX

@inproceedings{arsomngern2025iccv-zeroshot,
  title     = {{Zero-Shot Inexact CAD Model Alignment from a Single Image}},
  author    = {Arsomngern, Pattaramanee and Khwanmuang, Sasikarn and Nießner, Matthias and Suwajanakorn, Supasorn},
  booktitle = {International Conference on Computer Vision},
  year      = {2025},
  pages     = {6231-6241},
  url       = {https://mlanthology.org/iccv/2025/arsomngern2025iccv-zeroshot/}
}