MotionLM: Multi-Agent Motion Forecasting as Language Modeling

Abstract

Reliable forecasting of the future behavior of road agents is a critical component to safe planning in autonomous vehicles. Here, we represent continuous trajectories as sequences of discrete motion tokens and cast multi-agent motion prediction as a language modeling task over this domain. Our model, MotionLM, provides several advantages: First, it does not require anchors or explicit latent variable optimization to learn multimodal distributions. Instead, we leverage a single standard language modeling objective, maximizing the average log probability over sequence tokens. Second, our approach bypasses post-hoc interaction heuristics where individual agent trajectory generation is conducted prior to interactive scoring. Instead, MotionLM produces joint distributions over interactive agent futures in a single autoregressive decoding process. In addition, the model's sequential factorization enables temporally causal conditional rollouts. The proposed approach establishes new state-of-the-art performance for multi-agent motion prediction on the Waymo Open Motion Dataset, ranking 1st on the interactive challenge leaderboard.

Cite

Text

Seff et al. "MotionLM: Multi-Agent Motion Forecasting as Language Modeling." International Conference on Computer Vision, 2023. doi:10.1109/ICCV51070.2023.00788

Markdown

[Seff et al. "MotionLM: Multi-Agent Motion Forecasting as Language Modeling." International Conference on Computer Vision, 2023.](https://mlanthology.org/iccv/2023/seff2023iccv-motionlm/) doi:10.1109/ICCV51070.2023.00788

BibTeX

@inproceedings{seff2023iccv-motionlm,
  title     = {{MotionLM: Multi-Agent Motion Forecasting as Language Modeling}},
  author    = {Seff, Ari and Cera, Brian and Chen, Dian and Ng, Mason and Zhou, Aurick and Nayakanti, Nigamaa and Refaat, Khaled S. and Al-Rfou, Rami and Sapp, Benjamin},
  booktitle = {International Conference on Computer Vision},
  year      = {2023},
  pages     = {8579-8590},
  doi       = {10.1109/ICCV51070.2023.00788},
  url       = {https://mlanthology.org/iccv/2023/seff2023iccv-motionlm/}
}