Social Diffusion: Long-Term Multiple Human Motion Anticipation
Abstract
We propose Social Diffusion, a novel method for short-term and long-term forecasting of the motion of multiple persons as well as their social interactions. Jointly forecasting motions for multiple persons involved in social activities is inherently a challenging problem due to the interdependencies between individuals. In this work, we leverage a diffusion model conditioned on motion histories and causal temporal convolutional networks to forecast individually and contextually plausible motions for all participants. The contextual plausibility is achieved via an order-invariant aggregation function. As a second contribution, we design a new evaluation protocol that measures the plausibility of social interactions which we evaluate on the Haggling dataset, which features a challenging social activity where people are actively taking turns to talk and switching their attention. We evaluate our approach on four datasets for multi-person forecasting where our approach outperforms the state-of-the-art in terms of motion realism and contextual plausibility.
Cite
Text
Tanke et al. "Social Diffusion: Long-Term Multiple Human Motion Anticipation." International Conference on Computer Vision, 2023. doi:10.1109/ICCV51070.2023.00880Markdown
[Tanke et al. "Social Diffusion: Long-Term Multiple Human Motion Anticipation." International Conference on Computer Vision, 2023.](https://mlanthology.org/iccv/2023/tanke2023iccv-social/) doi:10.1109/ICCV51070.2023.00880BibTeX
@inproceedings{tanke2023iccv-social,
title = {{Social Diffusion: Long-Term Multiple Human Motion Anticipation}},
author = {Tanke, Julian and Zhang, Linguang and Zhao, Amy and Tang, Chengcheng and Cai, Yujun and Wang, Lezi and Wu, Po-Chen and Gall, Juergen and Keskin, Cem},
booktitle = {International Conference on Computer Vision},
year = {2023},
pages = {9601-9611},
doi = {10.1109/ICCV51070.2023.00880},
url = {https://mlanthology.org/iccv/2023/tanke2023iccv-social/}
}