VLP: Vision Language Planning for Autonomous Driving

Abstract

Autonomous driving is a complex and challenging task that aims at safe motion planning through scene understanding and reasoning. While vision-only autonomous driving methods have recently achieved notable performance through enhanced scene understanding several key issues including lack of reasoning low generalization performance and long-tail scenarios still need to be addressed. In this paper we present VLP a novel Vision-Language-Planning framework that exploits language models to bridge the gap between linguistic understanding and autonomous driving. VLP enhances autonomous driving systems by strengthening both the source memory foundation and the self-driving car's contextual understanding. VLP achieves state-of-the-art end-to-end planning performance on the challenging NuScenes dataset by achieving 35.9% and 60.5% reduction in terms of average L2 error and collision rates respectively compared to the previous best method. Moreover VLP shows improved performance in challenging long-tail scenarios and strong generalization capabilities when faced with new urban environments.

Cite

Text

Pan et al. "VLP: Vision Language Planning for Autonomous Driving." Conference on Computer Vision and Pattern Recognition, 2024. doi:10.1109/CVPR52733.2024.01398

Markdown

[Pan et al. "VLP: Vision Language Planning for Autonomous Driving." Conference on Computer Vision and Pattern Recognition, 2024.](https://mlanthology.org/cvpr/2024/pan2024cvpr-vlp/) doi:10.1109/CVPR52733.2024.01398

BibTeX

@inproceedings{pan2024cvpr-vlp,
  title     = {{VLP: Vision Language Planning for Autonomous Driving}},
  author    = {Pan, Chenbin and Yaman, Burhaneddin and Nesti, Tommaso and Mallik, Abhirup and Allievi, Alessandro G and Velipasalar, Senem and Ren, Liu},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2024},
  pages     = {14760-14769},
  doi       = {10.1109/CVPR52733.2024.01398},
  url       = {https://mlanthology.org/cvpr/2024/pan2024cvpr-vlp/}
}