Motion Reconstruction via Human Anatomy Diffusion from Sparse Tracking

Abstract

In the research field of analysis of people, generating precise full-body human motion from sparse tracking is a significant challenge. It is well known that diffusion techniques excel in generating high-quality two-dimensional (2D) visual content. However, when applied to human motion reconstruction, they might struggle to capture the inherent complexities of human motion, which is characterized by three-dimensional (3D) anatomical features and one-dimensional (1D) temporal dynamics. This heterogeneous structure between human motion and images can lead to accumulated errors at the joints, affecting the accuracy and smoothness of the generated motions. Building on this insight, we propose Human Anatomy Diffusion (HAD), a novel framework that integrates human anatomical features into the denoising process and excels in handling complex motions, accurately capturing body angles and balance, and showing enhanced alignment in motion prediction. HAD remarkably advanced the performance of motion reconstruction, notably enhancing smoothness by 81.29% compared to the previous state-of-the-art works and improving key accuracy metrics like MPJPE, Root PE, and Lower PE by approximately 20% on AMASS. Our method provides a crucial advancement for creating realistic and responsive virtual avatars in real-world applications. The project page is at: https://niuzehai.github.io/had/ .

Cite

Text

Niu et al. "Motion Reconstruction via Human Anatomy Diffusion from Sparse Tracking." European Conference on Computer Vision Workshops, 2024. doi:10.1007/978-3-031-91575-8_19

Markdown

[Niu et al. "Motion Reconstruction via Human Anatomy Diffusion from Sparse Tracking." European Conference on Computer Vision Workshops, 2024.](https://mlanthology.org/eccvw/2024/niu2024eccvw-motion/) doi:10.1007/978-3-031-91575-8_19

BibTeX

@inproceedings{niu2024eccvw-motion,
  title     = {{Motion Reconstruction via Human Anatomy Diffusion from Sparse Tracking}},
  author    = {Niu, Zehai and Lu, Ke and Dong, Kun and Xue, Jian and Qin, Xiaoyu and Wang, Jinbao},
  booktitle = {European Conference on Computer Vision Workshops},
  year      = {2024},
  pages     = {307-323},
  doi       = {10.1007/978-3-031-91575-8_19},
  url       = {https://mlanthology.org/eccvw/2024/niu2024eccvw-motion/}
}