GaussianEditor: Editing 3D Gaussians Delicately with Text Instructions

Abstract

Recently impressive results have been achieved in 3D scene editing with text instructions based on a 2D diffusion model. However current diffusion models primarily generate images by predicting noise in the latent space and the editing is usually applied to the whole image which makes it challenging to perform delicate especially localized editing for 3D scenes. Inspired by recent 3D Gaussian splatting we propose a systematic framework named GaussianEditor to edit 3D scenes delicately via 3D Gaussians with text instructions. Benefiting from the explicit property of 3D Gaussians we design a series of techniques to achieve delicate editing. Specifically we first extract the region of interest (RoI) corresponding to the text instruction aligning it to 3D Gaussians. The Gaussian RoI is further used to control the editing process. Our framework can achieve more delicate and precise editing of 3D scenes than previous methods while enjoying much faster training speed i.e. within 20 minutes on a single V100 GPU more than twice as fast as Instruct-NeRF2NeRF (45 minutes -- 2 hours). The project page is at GaussianEditor.github.io.

Cite

Text

Wang et al. "GaussianEditor: Editing 3D Gaussians Delicately with Text Instructions." Conference on Computer Vision and Pattern Recognition, 2024. doi:10.1109/CVPR52733.2024.01975

Markdown

[Wang et al. "GaussianEditor: Editing 3D Gaussians Delicately with Text Instructions." Conference on Computer Vision and Pattern Recognition, 2024.](https://mlanthology.org/cvpr/2024/wang2024cvpr-gaussianeditor/) doi:10.1109/CVPR52733.2024.01975

BibTeX

@inproceedings{wang2024cvpr-gaussianeditor,
  title     = {{GaussianEditor: Editing 3D Gaussians Delicately with Text Instructions}},
  author    = {Wang, Junjie and Fang, Jiemin and Zhang, Xiaopeng and Xie, Lingxi and Tian, Qi},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2024},
  pages     = {20902-20911},
  doi       = {10.1109/CVPR52733.2024.01975},
  url       = {https://mlanthology.org/cvpr/2024/wang2024cvpr-gaussianeditor/}
}