SelfSplat: Pose-Free and 3D Prior-Free Generalizable 3D Gaussian Splatting

Abstract

We propose SelfSplat, a novel 3D Gaussian Splatting model designed to perform pose-free and 3D prior-free generalizable 3D reconstruction from unposed multi-view images. These settings are inherently ill-posed due to the lack of ground-truth data, learned geometric information, and the need to achieve accurate 3D reconstruction without finetuning, making it difficult for conventional methods to achieve high-quality results. Our model addresses these challenges by effectively integrating explicit 3D representations with self-supervised depth and pose estimation techniques, resulting in reciprocal improvements in both pose accuracy and 3D reconstruction quality. Furthermore, we incorporate a matching-aware pose estimation network and a depth refinement module to enhance geometry consistency across views, ensuring more accurate and stable 3D reconstructions. To present the performance of our method, we evaluated it on large-scale real-world datasets, including RealEstate10K, ACID, and DL3DV. SelfSplat achieves superior results over previous state-of-the-art methods in both appearance and geometry quality, also demonstrates strong cross-dataset generalization capabilities. Extensive ablation studies and analysis also validate the effectiveness of our proposed methods.

Cite

Text

Kang et al. "SelfSplat: Pose-Free and 3D Prior-Free Generalizable 3D Gaussian Splatting." Conference on Computer Vision and Pattern Recognition, 2025. doi:10.1109/CVPR52734.2025.02050

Markdown

[Kang et al. "SelfSplat: Pose-Free and 3D Prior-Free Generalizable 3D Gaussian Splatting." Conference on Computer Vision and Pattern Recognition, 2025.](https://mlanthology.org/cvpr/2025/kang2025cvpr-selfsplat/) doi:10.1109/CVPR52734.2025.02050

BibTeX

@inproceedings{kang2025cvpr-selfsplat,
  title     = {{SelfSplat: Pose-Free and 3D Prior-Free Generalizable 3D Gaussian Splatting}},
  author    = {Kang, Gyeongjin and Yoo, Jisang and Park, Jihyeon and Nam, Seungtae and Im, Hyeonsoo and Shin, Sangheon and Kim, Sangpil and Park, Eunbyung},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2025},
  pages     = {22012-22022},
  doi       = {10.1109/CVPR52734.2025.02050},
  url       = {https://mlanthology.org/cvpr/2025/kang2025cvpr-selfsplat/}
}