Multi-Agent Long-Term 3D Human Pose Forecasting via Interaction-Aware Trajectory Conditioning
Abstract
Human pose forecasting garners attention for its diverse applications. However challenges in modeling the multi-modal nature of human motion and intricate interactions among agents persist particularly with longer timescales and more agents. In this paper we propose an interaction-aware trajectory-conditioned long-term multi-agent human pose forecasting model utilizing a coarse-to-fine prediction approach: multi-modal global trajectories are initially forecasted followed by respective local pose forecasts conditioned on each mode. In doing so our Trajectory2Pose model introduces a graph-based agent-wise interaction module for a reciprocal forecast of local motion-conditioned global trajectory and trajectory-conditioned local pose. Our model effectively handles the multi-modality of human motion and the complexity of long-term multi-agent interactions improving performance in complex environments. Furthermore we address the lack of long-term (6s+) multi-agent (5+) datasets by constructing a new dataset from real-world images and 2D annotations enabling a comprehensive evaluation of our proposed model. State-of-the-art prediction performance on both complex and simpler datasets confirms the generalized effectiveness of our method. The code is available at https://github.com/Jaewoo97/T2P.
Cite
Text
Jeong et al. "Multi-Agent Long-Term 3D Human Pose Forecasting via Interaction-Aware Trajectory Conditioning." Conference on Computer Vision and Pattern Recognition, 2024. doi:10.1109/CVPR52733.2024.00160Markdown
[Jeong et al. "Multi-Agent Long-Term 3D Human Pose Forecasting via Interaction-Aware Trajectory Conditioning." Conference on Computer Vision and Pattern Recognition, 2024.](https://mlanthology.org/cvpr/2024/jeong2024cvpr-multiagent/) doi:10.1109/CVPR52733.2024.00160BibTeX
@inproceedings{jeong2024cvpr-multiagent,
title = {{Multi-Agent Long-Term 3D Human Pose Forecasting via Interaction-Aware Trajectory Conditioning}},
author = {Jeong, Jaewoo and Park, Daehee and Yoon, Kuk-Jin},
booktitle = {Conference on Computer Vision and Pattern Recognition},
year = {2024},
pages = {1617-1628},
doi = {10.1109/CVPR52733.2024.00160},
url = {https://mlanthology.org/cvpr/2024/jeong2024cvpr-multiagent/}
}