Generative Rendering: Controllable 4D-Guided Video Generation with 2D Diffusion Models
Abstract
Traditional 3D content creation tools empower users to bring their imagination to life by giving them direct control over a scene's geometry appearance motion and camera path. Creating computer-generated videos however is a tedious manual process which can be automated by emerging text-to-video diffusion models. Despite great promise video diffusion models are difficult to control hindering users to apply their creativity rather than amplifying it. To address this challenge we present a novel approach that combines the controllability of dynamic 3D meshes with the expressivity and editability of emerging diffusion models. For this purpose our approach takes an animated low-fidelity rendered mesh as input and injects the ground truth correspondence information obtained from the dynamic mesh into various stages of a pre-trained text-to-image generation model to output high-quality and temporally consistent frames. We demonstrate our approach on various examples where motion can be obtained by animating rigged assets or changing the camera path.
Cite
Text
Cai et al. "Generative Rendering: Controllable 4D-Guided Video Generation with 2D Diffusion Models." Conference on Computer Vision and Pattern Recognition, 2024. doi:10.1109/CVPR52733.2024.00727Markdown
[Cai et al. "Generative Rendering: Controllable 4D-Guided Video Generation with 2D Diffusion Models." Conference on Computer Vision and Pattern Recognition, 2024.](https://mlanthology.org/cvpr/2024/cai2024cvpr-generative/) doi:10.1109/CVPR52733.2024.00727BibTeX
@inproceedings{cai2024cvpr-generative,
title = {{Generative Rendering: Controllable 4D-Guided Video Generation with 2D Diffusion Models}},
author = {Cai, Shengqu and Ceylan, Duygu and Gadelha, Matheus and Huang, Chun-Hao Paul and Wang, Tuanfeng Yang and Wetzstein, Gordon},
booktitle = {Conference on Computer Vision and Pattern Recognition},
year = {2024},
pages = {7611-7620},
doi = {10.1109/CVPR52733.2024.00727},
url = {https://mlanthology.org/cvpr/2024/cai2024cvpr-generative/}
}