CarPlanner: Consistent Auto-Regressive Trajectory Planning for Large-Scale Reinforcement Learning in Autonomous Driving

Abstract

Trajectory planning is vital for autonomous driving, ensuring safe and efficient navigation in complex environments. While recent learning-based methods, particularly reinforcement learning (RL), have shown promise in specific scenarios, RL planners struggle with training inefficiencies and managing large-scale, real-world driving scenarios.In this paper, we introduce CarPlanner, a Consistent auto-regressive Planner that uses RL to generate multi-modal trajectories. The auto-regressive structure enables efficient large-scale RL training, while the incorporation of consistency ensures stable policy learning by maintaining coherent temporal consistency across time steps. Moreover, CarPlanner employs a generation-selection framework with an expert-guided reward function and an invariant-view module, simplifying RL training and enhancing policy performance.Extensive analysis demonstrates that our proposed RL framework effectively addresses the challenges of training efficiency and performance enhancement, positioning CarPlanner as a promising solution for trajectory planning in autonomous driving.To the best of our knowledge, we are the first to demonstrate that the RL-based planner can surpass both IL- and rule-based state-of-the-arts (SOTAs) on the challenging large-scale real-world dataset nuPlan. Our proposed CarPlanner surpasses RL-, IL-, and rule-based SOTA approaches within this demanding dataset.

Cite

Text

Zhang et al. "CarPlanner: Consistent Auto-Regressive Trajectory Planning for Large-Scale Reinforcement Learning in Autonomous Driving." Conference on Computer Vision and Pattern Recognition, 2025. doi:10.1109/CVPR52734.2025.01607

Markdown

[Zhang et al. "CarPlanner: Consistent Auto-Regressive Trajectory Planning for Large-Scale Reinforcement Learning in Autonomous Driving." Conference on Computer Vision and Pattern Recognition, 2025.](https://mlanthology.org/cvpr/2025/zhang2025cvpr-carplanner/) doi:10.1109/CVPR52734.2025.01607

BibTeX

@inproceedings{zhang2025cvpr-carplanner,
  title     = {{CarPlanner: Consistent Auto-Regressive Trajectory Planning for Large-Scale Reinforcement Learning in Autonomous Driving}},
  author    = {Zhang, Dongkun and Liang, Jiaming and Guo, Ke and Lu, Sha and Wang, Qi and Xiong, Rong and Miao, Zhenwei and Wang, Yue},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2025},
  pages     = {17239-17248},
  doi       = {10.1109/CVPR52734.2025.01607},
  url       = {https://mlanthology.org/cvpr/2025/zhang2025cvpr-carplanner/}
}