End-to-End Interpretable Neural Motion Planner

Abstract

In this paper, we propose a neural motion planner for learning to drive autonomously in complex urban scenarios that include traffic-light handling, yielding, and interactions with multiple road-users. Towards this goal, we design a holistic model that takes as input raw LIDAR data and a HD map and produces interpretable intermediate representations in the form of 3D detections and their future trajectories, as well as a cost volume defining the goodness of each position that the self-driving car can take within the planning horizon. We then sample a set of diverse physically possible trajectories and choose the one with the minimum learned cost. Importantly, our cost volume is able to naturally capture multi-modality. We demonstrate the effectiveness of our approach in real-world driving data captured in several cities in North America. Our experiments show that the learned cost volume can generate safer planning than all the baselines.

Cite

Text

Zeng et al. "End-to-End Interpretable Neural Motion Planner." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019. doi:10.1109/CVPR.2019.00886

Markdown

[Zeng et al. "End-to-End Interpretable Neural Motion Planner." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019.](https://mlanthology.org/cvpr/2019/zeng2019cvpr-endtoend/) doi:10.1109/CVPR.2019.00886

BibTeX

@inproceedings{zeng2019cvpr-endtoend,
  title     = {{End-to-End Interpretable Neural Motion Planner}},
  author    = {Zeng, Wenyuan and Luo, Wenjie and Suo, Simon and Sadat, Abbas and Yang, Bin and Casas, Sergio and Urtasun, Raquel},
  booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year      = {2019},
  doi       = {10.1109/CVPR.2019.00886},
  url       = {https://mlanthology.org/cvpr/2019/zeng2019cvpr-endtoend/}
}