THÖR-Magni: Comparative Analysis of Deep Learning Models for Role-Conditioned Human Mtion Prediction
Abstract
Autonomous systems, that need to operate in human environments and interact with the users, rely on understanding and anticipating human activity and motion. Among the many factors which influence human motion, semantic attributes, such as the roles and ongoing activities of the detected people, provide a powerful cue on their future motion, actions, and intentions. In this work we adapt several popular deep learning models for trajectory prediction with labels corresponding to the roles of the people. To this end we use the novel THÖR-Magni dataset, which captures human activity in industrial settings and includes the relevant semantic labels for people who navigate complex environments, interact with objects and robots, work alone and in groups. In qualitative and quantitative experiments we show that the role-conditioned LSTM, Transformer, GAN and VAE methods can effectively incorporate the semantic categories, better capture the underlying input distribution and therefore produce more accurate motion predictions in terms of Top-K ADE/FDE and log-likelihood metrics.
Cite
Text
de Almeida et al. "THÖR-Magni: Comparative Analysis of Deep Learning Models for Role-Conditioned Human Mtion Prediction." IEEE/CVF International Conference on Computer Vision Workshops, 2023. doi:10.1109/ICCVW60793.2023.00234Markdown
[de Almeida et al. "THÖR-Magni: Comparative Analysis of Deep Learning Models for Role-Conditioned Human Mtion Prediction." IEEE/CVF International Conference on Computer Vision Workshops, 2023.](https://mlanthology.org/iccvw/2023/dealmeida2023iccvw-thormagni/) doi:10.1109/ICCVW60793.2023.00234BibTeX
@inproceedings{dealmeida2023iccvw-thormagni,
title = {{THÖR-Magni: Comparative Analysis of Deep Learning Models for Role-Conditioned Human Mtion Prediction}},
author = {de Almeida, Tiago Rodrigues and Rudenko, Andrey and Schreiter, Tim and Zhu, Yufei and Gutiérrez-Maestro, Eduardo and Morillo-Méndez, Lucas and Kucner, Tomasz Piotr and Mozos, Óscar Martínez and Magnusson, Martin and Palmieri, Luigi and Arras, Kai O. and Lilienthal, Achim J.},
booktitle = {IEEE/CVF International Conference on Computer Vision Workshops},
year = {2023},
pages = {2192-2201},
doi = {10.1109/ICCVW60793.2023.00234},
url = {https://mlanthology.org/iccvw/2023/dealmeida2023iccvw-thormagni/}
}