ReCamMaster: Camera-Controlled Generative Rendering from a Single Video
Abstract
Camera control has been actively studied in text or image conditioned video generation tasks. However, altering camera trajectories of a given video remains under-explored, despite its importance in the field of video creation. It is non-trivial due to the extra constraints of maintaining multiple-frame appearance and dynamic synchronization. To address this, we present ReCamMaster, a camera-controlled generative video re-rendering framework that reproduces the dynamic scene of an input video at novel camera trajectories. The core innovation lies in harnessing the generative capabilities of pre-trained text-to-video models through an elegant yet powerful video conditioning mechanism--an aspect often overlooked in current research. To overcome the scarcity of qualified training data, we construct a comprehensive multi-camera synchronized video dataset using Unreal Engine 5, which is carefully curated to follow real-world filming characteristics, covering diverse scenes and camera movements. It helps the model generalize to in-the-wild videos. Lastly, we further improve the robustness to diverse inputs through a meticulously designed training strategy. Extensive experiments show that our method substantially outperforms existing state-of-the-art approaches and strong baselines. Our method also finds promising applications in video stabilization, super-resolution, and outpainting. Our code and dataset are publicly available.
Cite
Text
Bai et al. "ReCamMaster: Camera-Controlled Generative Rendering from a Single Video." International Conference on Computer Vision, 2025.Markdown
[Bai et al. "ReCamMaster: Camera-Controlled Generative Rendering from a Single Video." International Conference on Computer Vision, 2025.](https://mlanthology.org/iccv/2025/bai2025iccv-recammaster/)BibTeX
@inproceedings{bai2025iccv-recammaster,
title = {{ReCamMaster: Camera-Controlled Generative Rendering from a Single Video}},
author = {Bai, Jianhong and Xia, Menghan and Fu, Xiao and Wang, Xintao and Mu, Lianrui and Cao, Jinwen and Liu, Zuozhu and Hu, Haoji and Bai, Xiang and Wan, Pengfei and Zhang, Di},
booktitle = {International Conference on Computer Vision},
year = {2025},
pages = {14834-14844},
url = {https://mlanthology.org/iccv/2025/bai2025iccv-recammaster/}
}