Bridging Diffusion Models and 3D Representations: A 3D Consistent Super-Resolution Framework
Abstract
We propose 3D Super Resolution (3DSR), a novel 3D Gaussian-splatting-based super-resolution framework that leverages off-the-shelf diffusion-based 2D super-resolution models. 3DSR encourages 3D consistency across views via the use of an explicit 3D Gaussian-splatting-based scene representation. This makes the proposed 3DSR different from prior work, such as image upsampling or the use of video super-resolution, which either don't consider 3D consistency or aim to incorporate 3D consistency implicitly. Notably, our method enhances visual quality without additional fine-tuning, ensuring spatial coherence within the reconstructed scene. We evaluate 3DSR on MipNeRF360 and LLFF data, demonstrating that it produces high-resolution results that are visually compelling, while maintaining structural consistency in 3D reconstructions. Code will be released.
Cite
Text
Chen et al. "Bridging Diffusion Models and 3D Representations: A 3D Consistent Super-Resolution Framework." International Conference on Computer Vision, 2025.Markdown
[Chen et al. "Bridging Diffusion Models and 3D Representations: A 3D Consistent Super-Resolution Framework." International Conference on Computer Vision, 2025.](https://mlanthology.org/iccv/2025/chen2025iccv-bridging/)BibTeX
@inproceedings{chen2025iccv-bridging,
title = {{Bridging Diffusion Models and 3D Representations: A 3D Consistent Super-Resolution Framework}},
author = {Chen, Yi-Ting and Liao, Ting-Hsuan and Guo, Pengsheng and Schwing, Alexander and Huang, Jia-Bin},
booktitle = {International Conference on Computer Vision},
year = {2025},
pages = {13481-13490},
url = {https://mlanthology.org/iccv/2025/chen2025iccv-bridging/}
}