Multi-Person Pose Forecasting with Individual Interaction Perceptron and Prior Learning

Abstract

Human Pose Forecasting is a major problem in human intention comprehension that can be addressed through learning the historical poses via deep methods. However, existing methods often lack the modeling of the person’s role in the event in multi-person scenes. This leads to limited performance in complicated scenes with variant interactions happening at the same time. In this paper, we introduce the Interaction-Aware Pose Forecasting Transformer (IAFormer) framework to better learn the interaction features. With the key insight that the event often involves only part of the people in the scene, we designed the Interaction Perceptron Module (IPM) to evaluate the human-to-event interaction level. With the interaction evaluation, the human-independent features are extracted with the attention mechanism for interaction-aware forecasting. In addition, an Interaction Prior Learning Module (IPLM) is presented to learn and accumulate prior knowledge of high-frequency interactions, encouraging semantic pose forecasting rather than simple trajectory pose forecasting. We conduct experiments using datasets such as CMU-Mocap, UMPM, CHI3D, Human3.6M, and synthesized crowd datasets. The results demonstrate that our method significantly outperforms state-of-the-art approaches considering scenarios with varying numbers of people. Code is available at purplehttps: //github.com/ArcticPole/IAFormer

Cite

Text

Xiao et al. "Multi-Person Pose Forecasting with Individual Interaction Perceptron and Prior Learning." Proceedings of the European Conference on Computer Vision (ECCV), 2024. doi:10.1007/978-3-031-72649-1_23

Markdown

[Xiao et al. "Multi-Person Pose Forecasting with Individual Interaction Perceptron and Prior Learning." Proceedings of the European Conference on Computer Vision (ECCV), 2024.](https://mlanthology.org/eccv/2024/xiao2024eccv-multiperson/) doi:10.1007/978-3-031-72649-1_23

BibTeX

@inproceedings{xiao2024eccv-multiperson,
  title     = {{Multi-Person Pose Forecasting with Individual Interaction Perceptron and Prior Learning}},
  author    = {Xiao, Peng and Xie, Yi and Xu, Xuemiao and Chen, Weihong and Zhang, Huaidong},
  booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
  year      = {2024},
  doi       = {10.1007/978-3-031-72649-1_23},
  url       = {https://mlanthology.org/eccv/2024/xiao2024eccv-multiperson/}
}