DriveGPT: Scaling Autoregressive Behavior Models for Driving
Abstract
We present DriveGPT, a scalable behavior model for autonomous driving. We model driving as a sequential decision-making task, and learn a transformer model to predict future agent states as tokens in an autoregressive fashion. We scale up our model parameters and training data by multiple orders of magnitude, enabling us to explore the scaling properties in terms of dataset size, model parameters, and compute. We evaluate DriveGPT across different scales in a planning task, through both quantitative metrics and qualitative examples, including closed-loop driving in complex real-world scenarios. In a separate prediction task, DriveGPT outperforms state-of-the-art baselines and exhibits improved performance by pretraining on a large-scale dataset, further validating the benefits of data scaling.
Cite
Text
Huang et al. "DriveGPT: Scaling Autoregressive Behavior Models for Driving." Proceedings of the 42nd International Conference on Machine Learning, 2025.Markdown
[Huang et al. "DriveGPT: Scaling Autoregressive Behavior Models for Driving." Proceedings of the 42nd International Conference on Machine Learning, 2025.](https://mlanthology.org/icml/2025/huang2025icml-drivegpt/)BibTeX
@inproceedings{huang2025icml-drivegpt,
title = {{DriveGPT: Scaling Autoregressive Behavior Models for Driving}},
author = {Huang, Xin and Wolff, Eric M and Vernaza, Paul and Phan-Minh, Tung and Chen, Hongge and Hayden, David S and Edmonds, Mark and Pierce, Brian and Chen, Xinxin and Jacob, Pratik Elias and Chen, Xiaobai and Tairbekov, Chingiz and Agarwal, Pratik and Gao, Tianshi and Chai, Yuning and Srinivasa, Siddhartha},
booktitle = {Proceedings of the 42nd International Conference on Machine Learning},
year = {2025},
pages = {25908-25921},
volume = {267},
url = {https://mlanthology.org/icml/2025/huang2025icml-drivegpt/}
}