Robust 3D Shape Reconstruction in Zero-Shot from a Single Image in the Wild
Abstract
Recent monocular 3D shape reconstruction methods have shown promising zero-shot results on object-segmented images without any occlusions. However, their effectiveness is significantly compromised in real-world conditions, due to imperfect object segmentation by off-the-shelf models and the prevalence of occlusions. To effectively address these issues, we propose a unified regression model that integrates segmentation and reconstruction, specifically designed for occlusion-aware 3D shape reconstruction. To facilitate its reconstruction in the wild, we also introduce a scalable data synthesis pipeline that simulates a wide range of variations in objects, occluders, and backgrounds. Training on our synthetic data enables the proposed model to achieve state-of-the-art zero-shot results on real-world images, using significantly fewer parameters than competing approaches.
Cite
Text
Cho et al. "Robust 3D Shape Reconstruction in Zero-Shot from a Single Image in the Wild." Conference on Computer Vision and Pattern Recognition, 2025. doi:10.1109/CVPR52734.2025.02122Markdown
[Cho et al. "Robust 3D Shape Reconstruction in Zero-Shot from a Single Image in the Wild." Conference on Computer Vision and Pattern Recognition, 2025.](https://mlanthology.org/cvpr/2025/cho2025cvpr-robust/) doi:10.1109/CVPR52734.2025.02122BibTeX
@inproceedings{cho2025cvpr-robust,
title = {{Robust 3D Shape Reconstruction in Zero-Shot from a Single Image in the Wild}},
author = {Cho, Junhyeong and Youwang, Kim and Yang, Hunmin and Oh, Tae-Hyun},
booktitle = {Conference on Computer Vision and Pattern Recognition},
year = {2025},
pages = {22786-22798},
doi = {10.1109/CVPR52734.2025.02122},
url = {https://mlanthology.org/cvpr/2025/cho2025cvpr-robust/}
}