Free4D: Tuning-Free 4D Scene Generation with Spatial-Temporal Consistency

Liu, Tianqi; Huang, Zihao; Chen, Zhaoxi; Wang, Guangcong; Hu, Shoukang; Shen, Liao; Sun, Huiqiang; Cao, Zhiguo; Li, Wei; Liu, Ziwei

Free4D: Tuning-Free 4D Scene Generation with Spatial-Temporal Consistency

Tianqi Liu, Zihao Huang, Zhaoxi Chen, Guangcong Wang, Shoukang Hu, Liao Shen, Huiqiang Sun, Zhiguo Cao, Wei Li, Ziwei Liu

ICCV 2025 pp. 25571-25582

/iccv/2025/liu2025iccv-free4d/

Abstract

We present Free4D, a novel tuning-free framework for 4D scene generation from a single image. Existing methods either focus on object-level generation, making scene-level generation infeasible, or rely on large-scale multi-view video datasets for expensive training, with limited generalization ability due to the scarcity of 4D scene data. In contrast, our key insight is to distill pre-trained foundation models for consistent 4D scene representation, which offers promising advantages such as efficiency and generalizability. 1) To achieve this, we first animate the input image using image-to-video diffusion models followed by 4D geometric structure initialization. 2) To turn this coarse structure into spatial-temporal consistent multi-view videos, we design an adaptive guidance mechanism with a point-guided denoising strategy for spatial consistency and a novel latent replacement strategy for temporal coherence. 3) To lift these generated observations into consistent 4D representation, we propose a modulation-based refinement to mitigate inconsistencies while fully leveraging the generated information. The resulting 4D representation enables real-time, controllable rendering, marking a significant advancement in single-image-based 4D scene generation.

PDF ICCV Semantic Scholar

Cite

Text

Liu et al. "Free4D: Tuning-Free 4D Scene Generation with Spatial-Temporal Consistency." International Conference on Computer Vision, 2025.

Markdown

[Liu et al. "Free4D: Tuning-Free 4D Scene Generation with Spatial-Temporal Consistency." International Conference on Computer Vision, 2025.](https://mlanthology.org/iccv/2025/liu2025iccv-free4d/)

BibTeX

@inproceedings{liu2025iccv-free4d,
  title     = {{Free4D: Tuning-Free 4D Scene Generation with Spatial-Temporal Consistency}},
  author    = {Liu, Tianqi and Huang, Zihao and Chen, Zhaoxi and Wang, Guangcong and Hu, Shoukang and Shen, Liao and Sun, Huiqiang and Cao, Zhiguo and Li, Wei and Liu, Ziwei},
  booktitle = {International Conference on Computer Vision},
  year      = {2025},
  pages     = {25571-25582},
  url       = {https://mlanthology.org/iccv/2025/liu2025iccv-free4d/}
}