HOI-Swap: Swapping Objects in Videos with Hand-Object Interaction Awareness
Abstract
We study the problem of precisely swapping objects in videos, with a focus on those interacted with by hands, given one user-provided reference object image. Despite the great advancements that diffusion models have made in video editing recently, these models often fall short in handling the intricacies of hand-object interactions (HOI), failing to produce realistic edits---especially when object swapping results in object shape or functionality changes. To bridge this gap, we present HOI-Swap, a novel diffusion-based video editing framework trained in a self-supervised manner. Designed in two stages, the first stage focuses on object swapping in a single frame with HOI awareness; the model learns to adjust the interaction patterns, such as the hand grasp, based on changes in the object's properties. The second stage extends the single-frame edit across the entire sequence; we achieve controllable motion alignment with the original video by: (1) warping a new sequence from the stage-I edited frame based on sampled motion points and (2) conditioning video generation on the warped sequence. Comprehensive qualitative and quantitative evaluations demonstrate that HOI-Swap significantly outperforms existing methods, delivering high-quality video edits with realistic HOIs.
Cite
Text
Xue et al. "HOI-Swap: Swapping Objects in Videos with Hand-Object Interaction Awareness." Neural Information Processing Systems, 2024. doi:10.52202/079017-2454Markdown
[Xue et al. "HOI-Swap: Swapping Objects in Videos with Hand-Object Interaction Awareness." Neural Information Processing Systems, 2024.](https://mlanthology.org/neurips/2024/xue2024neurips-hoiswap/) doi:10.52202/079017-2454BibTeX
@inproceedings{xue2024neurips-hoiswap,
title = {{HOI-Swap: Swapping Objects in Videos with Hand-Object Interaction Awareness}},
author = {Xue, Zihui and Luo, Mi and Chen, Changan and Grauman, Kristen},
booktitle = {Neural Information Processing Systems},
year = {2024},
doi = {10.52202/079017-2454},
url = {https://mlanthology.org/neurips/2024/xue2024neurips-hoiswap/}
}