MotionWeaver: Holistic 4D-Anchored Framework for Multi-Humanoid Image Animation

Abstract

Character image animation, which synthesizes videos of reference characters driven by pose sequences, has advanced rapidly but remains largely limited to single-human settings. Existing methods struggle to generalize to multi-humanoid scenarios, which involve diverse humanoid forms, complex interactions, and frequent occlusions. We address this gap with two key innovations. First, we introduce unified motion representations that extract identity-agnostic motions and explicitly bind them to corresponding characters, enabling generalization across diverse humanoid forms and seamless extension to multi-humanoid scenarios. Second, we propose a holistic 4D-anchored paradigm that constructs a shared 4D space to fuse motion representations with video latents, and further reinforces this process with hierarchical 4D-level supervision to better handle interactions and occlusions. We instantiate these ideas in MotionWeaver, an end-to-end framework for multi-humanoid image animation. To support this setting, we curate a 46-hour dataset of multi-human videos with rich interactions, and construct a 300-video benchmark featuring paired humanoid characters. Quantitative and qualitative experiments demonstrate that MotionWeaver not only achieves state-of-the-art results on our benchmark but also generalizes effectively across diverse humanoid forms, complex interactions, and challenging multi-humanoid scenarios.

Cite

Text

Hu et al. "MotionWeaver: Holistic 4D-Anchored Framework for Multi-Humanoid Image Animation." International Conference on Learning Representations, 2026.

Markdown

[Hu et al. "MotionWeaver: Holistic 4D-Anchored Framework for Multi-Humanoid Image Animation." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/hu2026iclr-motionweaver/)

BibTeX

@inproceedings{hu2026iclr-motionweaver,
  title     = {{MotionWeaver: Holistic 4D-Anchored Framework for Multi-Humanoid Image Animation}},
  author    = {Hu, Xirui and Ding, Yanbo and Wang, Jiahao and Shi, Tingting and Wang, Yali and Zhi, Guo Zhi and Zhang, Weizhan},
  booktitle = {International Conference on Learning Representations},
  year      = {2026},
  url       = {https://mlanthology.org/iclr/2026/hu2026iclr-motionweaver/}
}