A Plug-and-Play Physical Motion Restoration Approach for In-the-Wild High-Difficulty Motions

Abstract

Extracting physically plausible 3D human motion from videos is a critical task. Although existing simulation-based motion imitation methods can enhance the physical quality of daily motions estimated from monocular video capture, extending this capability to high-difficulty motions remains an open challenge. This can be attributed to some flawed motion clips in video-based motion capture results and the inherent complexity in modeling high-difficulty motions. Therefore, sensing the advantage of segmentation in localizing human body, we introduce a mask-based motion correction module (MCM) that leverages motion context and video mask to repair flawed motions; and propose a physics-based motion transfer module (PTM), which employs a prior injected pretrain and adapt approach for motion imitation, improving physical plausibility with the ability to handle in-the-wild and challenging motions. Our approach is designed as a plug-and-play module to physically refine the video motion capture, which also excels in motion generation tasks. Finally, we collected a challenging in-the-wild test set to establish a benchmark, and our method has demonstrated effectiveness on both the new benchmark and existing public datasets. Our project page is : https://physicalmotionrestoration.github.io/

Cite

Text

Zhang et al. "A Plug-and-Play Physical Motion Restoration Approach for In-the-Wild High-Difficulty Motions." International Conference on Computer Vision, 2025.

Markdown

[Zhang et al. "A Plug-and-Play Physical Motion Restoration Approach for In-the-Wild High-Difficulty Motions." International Conference on Computer Vision, 2025.](https://mlanthology.org/iccv/2025/zhang2025iccv-plugandplay/)

BibTeX

@inproceedings{zhang2025iccv-plugandplay,
  title     = {{A Plug-and-Play Physical Motion Restoration Approach for In-the-Wild High-Difficulty Motions}},
  author    = {Zhang, Youliang and Li, Ronghui and Zhang, Yachao and Pan, Liang and Wang, Jingbo and Liu, Yebin and Li, Xiu},
  booktitle = {International Conference on Computer Vision},
  year      = {2025},
  pages     = {13281-13292},
  url       = {https://mlanthology.org/iccv/2025/zhang2025iccv-plugandplay/}
}