LCGen: Mining in Low-Certainty Generation for View-Consistent Text-to-3D
Abstract
The Janus Problem is a common issue in SDS-based text-to-3D methods. Due to view encoding approach and 2D diffusion prior guidance, the 3D representation model tends to learn content with higher certainty from each perspective, leading to view inconsistency. In this work, we first model and analyze the problem, visualizing the specific causes of the Janus Problem, which are associated with discrete view encoding and shared priors in 2D lifting. Based on this, we further propose the LCGen method, which guides text-to-3D to obtain different priors with different certainty from various viewpoints, aiding in view-consistent generation. Experiments have proven that our LCGen method can be directly applied to different SDS-based text-to-3D methods, alleviating the Janus Problem without introducing additional information, increasing excessive training burden, or compromising the generation effect.
Cite
Text
Tao et al. "LCGen: Mining in Low-Certainty Generation for View-Consistent Text-to-3D." Neural Information Processing Systems, 2024. doi:10.52202/079017-0641Markdown
[Tao et al. "LCGen: Mining in Low-Certainty Generation for View-Consistent Text-to-3D." Neural Information Processing Systems, 2024.](https://mlanthology.org/neurips/2024/tao2024neurips-lcgen/) doi:10.52202/079017-0641BibTeX
@inproceedings{tao2024neurips-lcgen,
title = {{LCGen: Mining in Low-Certainty Generation for View-Consistent Text-to-3D}},
author = {Tao, Zeng and Yang, Tong and Lin, Junxiong and Mai, Xinji and Wang, Haoran and Wang, Beining and Zhou, Enyu and Wang, Yan and Zhang, Wenqiang},
booktitle = {Neural Information Processing Systems},
year = {2024},
doi = {10.52202/079017-0641},
url = {https://mlanthology.org/neurips/2024/tao2024neurips-lcgen/}
}