InstanceGaussian: Appearance-Semantic Joint Gaussian Representation for 3D Instance-Level Perception

Abstract

3D scene understanding is vital for applications in autonomous driving, robotics, and augmented reality. However, scene understanding based on 3D Gaussian Splatting faces three key challenges: (i) an imbalance between appearance and semantics, (ii) inconsistencies in object boundaries, and (iii) difficulties with top-down instance segmentation. To address these challenges, we propose InstanceGaussian, a method that jointly learns appearance and semantic features while adaptively aggregating instances. Our contributions are as follows: (i) a new Semantic-Scaffold-GS representation to improve feature representation and boundary delineation, (ii) a progressive training strategy for enhanced stability and segmentation, and (iii) a category-agnostic, bottom-up instance aggregation approach for better segmentation. Experimental results demonstrate that our approach achieves state-of-the-art performance in category-agnostic, open-vocabulary 3D point-level segmentation, validating the effectiveness of our proposed method.

Cite

Text

Li et al. "InstanceGaussian: Appearance-Semantic Joint Gaussian Representation for 3D Instance-Level Perception." Conference on Computer Vision and Pattern Recognition, 2025. doi:10.1109/CVPR52734.2025.01314

Markdown

[Li et al. "InstanceGaussian: Appearance-Semantic Joint Gaussian Representation for 3D Instance-Level Perception." Conference on Computer Vision and Pattern Recognition, 2025.](https://mlanthology.org/cvpr/2025/li2025cvpr-instancegaussian/) doi:10.1109/CVPR52734.2025.01314

BibTeX

@inproceedings{li2025cvpr-instancegaussian,
  title     = {{InstanceGaussian: Appearance-Semantic Joint Gaussian Representation for 3D Instance-Level Perception}},
  author    = {Li, Haijie and Wu, Yanmin and Meng, Jiarui and Gao, Qiankun and Zhang, Zhiyao and Wang, Ronggang and Zhang, Jian},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2025},
  pages     = {14078-14088},
  doi       = {10.1109/CVPR52734.2025.01314},
  url       = {https://mlanthology.org/cvpr/2025/li2025cvpr-instancegaussian/}
}