Improving Agent Behaviors with RL Fine-Tuning for Autonomous Driving
Abstract
A major challenge in autonomous vehicle research is modeling agent behaviors, which has critical applications including constructing realistic and reliable simulations for off-board evaluation and forecasting traffic agents motion for onboard planning. While supervised learning has shown success in modeling agents across various domains, these models can suffer from distribution shift when deployed at test-time. In this work, we improve the reliability of agent behaviors by closed-loop fine-tuning of behavior models with reinforcement learning. Our method demonstrates improved overall performance, as well as targeted metrics such as collision rate, on the Waymo Open Sim Agents challenge. Additionally, we present a novel policy evaluation benchmark to directly assess the ability of simulated agents to measure quality of autonomous vehicle planners and demonstrate the effectiveness of our approach on this new benchmark. all_papers.txt decode_tex_noligatures.sh decode_tex_noligatures.sh~ decode_tex.sh decode_tex.sh~ ECCV_abstracts.csv ECCV_abstracts_good.csv ECCV.csv ECCV.csv~ ECCV_new.csv generate_list.sh generate_list.sh~ generate_overview.sh gen.sh gen.sh~ HOWTO HOWTO~ pdflist pdflist.copied RCS snippet.html Work done as an intern at Waymo.
Cite
Text
Peng et al. "Improving Agent Behaviors with RL Fine-Tuning for Autonomous Driving." Proceedings of the European Conference on Computer Vision (ECCV), 2024. doi:10.1007/978-3-031-72698-9_10Markdown
[Peng et al. "Improving Agent Behaviors with RL Fine-Tuning for Autonomous Driving." Proceedings of the European Conference on Computer Vision (ECCV), 2024.](https://mlanthology.org/eccv/2024/peng2024eccv-improving/) doi:10.1007/978-3-031-72698-9_10BibTeX
@inproceedings{peng2024eccv-improving,
title = {{Improving Agent Behaviors with RL Fine-Tuning for Autonomous Driving}},
author = {Peng, Zhenghao and Luo, Wenjie and Lu, Yiren and Shen, Tianyi and Gulino, Cole and Seff, Ari and Fu, Justin},
booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
year = {2024},
doi = {10.1007/978-3-031-72698-9_10},
url = {https://mlanthology.org/eccv/2024/peng2024eccv-improving/}
}