Tackling View-Dependent Semantics in 3D Language Gaussian Splatting

Abstract

Recent advancements in 3D Gaussian Splatting (3D-GS) enable high-quality 3D scene reconstruction from RGB images. Many studies extend this paradigm for language-driven open-vocabulary scene understanding. However, most of them simply project 2D semantic features onto 3D Gaussians and overlook a fundamental gap between 2D and 3D understanding: a 3D object may exhibit various semantics from different viewpoints—a phenomenon we term view-dependent semantics. To address this challenge, we propose LaGa (Language Gaussians), which establishes cross-view semantic connections by decomposing the 3D scene into objects. Then, it constructs view-aggregated semantic representations by clustering semantic descriptors and reweighting them based on multi-view semantics. Extensive experiments demonstrate that LaGa effectively captures key information from view-dependent semantics, enabling a more comprehensive understanding of 3D scenes. Notably, under the same settings, LaGa achieves a significant improvement of +18.7% mIoU over the previous SOTA on the LERF-OVS dataset. Our code is available at: https://github.com/https://github.com/SJTU-DeepVisionLab/LaGa.

Cite

Text

Cen et al. "Tackling View-Dependent Semantics in 3D Language Gaussian Splatting." Proceedings of the 42nd International Conference on Machine Learning, 2025.

Markdown

[Cen et al. "Tackling View-Dependent Semantics in 3D Language Gaussian Splatting." Proceedings of the 42nd International Conference on Machine Learning, 2025.](https://mlanthology.org/icml/2025/cen2025icml-tackling/)

BibTeX

@inproceedings{cen2025icml-tackling,
  title     = {{Tackling View-Dependent Semantics in 3D Language Gaussian Splatting}},
  author    = {Cen, Jiazhong and Zhou, Xudong and Fang, Jiemin and Wen, Changsong and Xie, Lingxi and Zhang, Xiaopeng and Shen, Wei and Tian, Qi},
  booktitle = {Proceedings of the 42nd International Conference on Machine Learning},
  year      = {2025},
  pages     = {7013-7034},
  volume    = {267},
  url       = {https://mlanthology.org/icml/2025/cen2025icml-tackling/}
}