LiMTR: Time Series Motion Prediction for Diverse Road Users Through Multimodal Feature Integration

Abstract

Predicting the behavior of road users accurately is crucial to enable the safe operation of autonomous vehicles in urban or densely populated areas. Therefore, there has been a growing interest in time series motion prediction research, leading to significant advancements in state-of-the-art techniques in recent years. However, the potential of using LiDAR data to capture more detailed local features, such as a person's gaze or posture, remains largely unexplored. To address this, we develop a novel multimodal approach for motion prediction based on the PointNet foundation model architecture, incorporating local LiDAR features. Evaluation on the Waymo Open Dataset shows a performance improvement of 6.20% and 1.58% in minADE and mAP respectively, when integrated and compared with the previous state-of-the-art MTR. We open-source the code of our LiMTR model.

Cite

Text

Oerlemans et al. "LiMTR: Time Series Motion Prediction for Diverse Road Users Through Multimodal Feature Integration." NeurIPS 2024 Workshops: TSALM, 2024.

Markdown

[Oerlemans et al. "LiMTR: Time Series Motion Prediction for Diverse Road Users Through Multimodal Feature Integration." NeurIPS 2024 Workshops: TSALM, 2024.](https://mlanthology.org/neuripsw/2024/oerlemans2024neuripsw-limtr/)

BibTeX

@inproceedings{oerlemans2024neuripsw-limtr,
  title     = {{LiMTR: Time Series Motion Prediction for Diverse Road Users Through Multimodal Feature Integration}},
  author    = {Oerlemans, Camiel and Grooten, Bram and Braat, Michiel and Alassi, Alaa and Silvas, Emilia and Mocanu, Decebal Constantin},
  booktitle = {NeurIPS 2024 Workshops: TSALM},
  year      = {2024},
  url       = {https://mlanthology.org/neuripsw/2024/oerlemans2024neuripsw-limtr/}
}