AnyPortal: Zero-Shot Consistent Video Background Replacement
Abstract
Despite the rapid advancements in video generation technology, creating high-quality videos that precisely align with user intentions remains a significant challenge. Existing methods often fail to achieve fine-grained control over video details, limiting their practical applicability. We introduce AnyPortal, a novel zero-shot framework for video background replacement that leverages pre-trained diffusion models. Our framework collaboratively integrates the temporal prior of video diffusion models with the relighting capabilities of image diffusion models in a zero-shot setting. To address the critical challenge of foreground consistency, we propose a Refinement Projection Algorithm, which enables pixel-level detail manipulation to ensure precise foreground preservation. AnyPortal is training-free and overcomes the challenges of achieving foreground consistency and temporally coherent relighting. Experimental results demonstrate that AnyPortal achieves high-quality results on consumer-grade GPUs, offering a practical and efficient solution for video content creation and editing.
Cite
Text
Gao et al. "AnyPortal: Zero-Shot Consistent Video Background Replacement." International Conference on Computer Vision, 2025.Markdown
[Gao et al. "AnyPortal: Zero-Shot Consistent Video Background Replacement." International Conference on Computer Vision, 2025.](https://mlanthology.org/iccv/2025/gao2025iccv-anyportal/)BibTeX
@inproceedings{gao2025iccv-anyportal,
title = {{AnyPortal: Zero-Shot Consistent Video Background Replacement}},
author = {Gao, Wenshuo and Lan, Xicheng and Yang, Shuai},
booktitle = {International Conference on Computer Vision},
year = {2025},
pages = {18990-18999},
url = {https://mlanthology.org/iccv/2025/gao2025iccv-anyportal/}
}