Drafting and Revision: Advancing High-Fidelity Video Inpainting

Abstract

Video inpainting aims to fill the missing regions in video with spatial-temporally coherent contents. Existing methods usually treat the missing contents as a whole and adopt a hybrid objective containing a reconstruction loss and an adversarial loss to train the model. However, these two kinds of loss focus on contents at different frequencies, simply combining them may cause inter-frequency conflicts, leading the trained model to generate compromised results. Inspired by the common corrupted painting restoration process of “drawing a draft first and then revising the details later”, this paper proposes a Drafting-and-Revision Completion Network (DRCN) for video inpainting. Specifically, we first design a Drafting Network that utilizes the temporal information to complete the low-frequency semantic structure at low resolution. Then, a Revision Network is developed to hallucinate high-frequency details at high resolution by using the output of Drafting Network. In this way, adversarial loss and reconstruction loss can be applied to high-frequency and low-frequency respectively, effectively mitigating inter-frequency conflicts. Furthermore, Revision Network can be stacked in a pyramid manner to generate higher resolution details, which provide a feasible solution for high-resolution video inpainting. Experiments show that DRCN achieves improvements of 7.43% and 12.64% in E_warp and LPIPS, and can handle higher resolution videos on limited GPU memory.

Cite

Text

Wu et al. "Drafting and Revision: Advancing High-Fidelity Video Inpainting." International Joint Conference on Artificial Intelligence, 2025. doi:10.24963/IJCAI.2025/230

Markdown

[Wu et al. "Drafting and Revision: Advancing High-Fidelity Video Inpainting." International Joint Conference on Artificial Intelligence, 2025.](https://mlanthology.org/ijcai/2025/wu2025ijcai-drafting/) doi:10.24963/IJCAI.2025/230

BibTeX

@inproceedings{wu2025ijcai-drafting,
  title     = {{Drafting and Revision: Advancing High-Fidelity Video Inpainting}},
  author    = {Wu, Zhiliang and Li, Kun and Fan, Hehe and Yang, Yi},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2025},
  pages     = {2063-2071},
  doi       = {10.24963/IJCAI.2025/230},
  url       = {https://mlanthology.org/ijcai/2025/wu2025ijcai-drafting/}
}