AnyPortal: Zero-Shot Consistent Video Background Replacement

Abstract

Despite the rapid advancements in video generation technology, creating high-quality videos that precisely align with user intentions remains a significant challenge. Existing methods often fail to achieve fine-grained control over video details, limiting their practical applicability. We introduce AnyPortal, a novel zero-shot framework for video background replacement that leverages pre-trained diffusion models. Our framework collaboratively integrates the temporal prior of video diffusion models with the relighting capabilities of image diffusion models in a zero-shot setting. To address the critical challenge of foreground consistency, we propose a Refinement Projection Algorithm, which enables pixel-level detail manipulation to ensure precise foreground preservation. AnyPortal is training-free and overcomes the challenges of achieving foreground consistency and temporally coherent relighting. Experimental results demonstrate that AnyPortal achieves high-quality results on consumer-grade GPUs, offering a practical and efficient solution for video content creation and editing.

Cite

Text

Gao et al. "AnyPortal: Zero-Shot Consistent Video Background Replacement." International Conference on Computer Vision, 2025.

Markdown

[Gao et al. "AnyPortal: Zero-Shot Consistent Video Background Replacement." International Conference on Computer Vision, 2025.](https://mlanthology.org/iccv/2025/gao2025iccv-anyportal/)

BibTeX

@inproceedings{gao2025iccv-anyportal,
  title     = {{AnyPortal: Zero-Shot Consistent Video Background Replacement}},
  author    = {Gao, Wenshuo and Lan, Xicheng and Yang, Shuai},
  booktitle = {International Conference on Computer Vision},
  year      = {2025},
  pages     = {18990-18999},
  url       = {https://mlanthology.org/iccv/2025/gao2025iccv-anyportal/}
}