SocialVAE: Human Trajectory Prediction Using Timewise Latents

Abstract

Predicting pedestrian movement is critical for human behavior analysis and also for safe and efficient human-agent interactions. However, despite significant advancements, it is still challenging for existing approaches to capture the uncertainty and multimodality of human navigation decision making. In this paper, we propose SocialVAE, a novel approach for human trajectory prediction. The core of SocialVAE is a timewise variational autoencoder architecture that exploits stochastic recurrent neural networks to perform prediction, combined with a social attention mechanism and a backward posterior approximation to allow for better extraction of pedestrian navigation strategies. We show that SocialVAE improves current state-of-the-art performance on several pedestrian trajectory prediction benchmarks, including the ETH/UCY benchmark, Stanford Drone Dataset, and SportVU NBA movement dataset.

Cite

Text

Xu et al. "SocialVAE: Human Trajectory Prediction Using Timewise Latents." Proceedings of the European Conference on Computer Vision (ECCV), 2022. doi:10.1007/978-3-031-19772-7_30

Markdown

[Xu et al. "SocialVAE: Human Trajectory Prediction Using Timewise Latents." Proceedings of the European Conference on Computer Vision (ECCV), 2022.](https://mlanthology.org/eccv/2022/xu2022eccv-socialvae/) doi:10.1007/978-3-031-19772-7_30

BibTeX

@inproceedings{xu2022eccv-socialvae,
  title     = {{SocialVAE: Human Trajectory Prediction Using Timewise Latents}},
  author    = {Xu, Pei and Hayet, Jean-Bernard and Karamouzas, Ioannis},
  booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
  year      = {2022},
  doi       = {10.1007/978-3-031-19772-7_30},
  url       = {https://mlanthology.org/eccv/2022/xu2022eccv-socialvae/}
}