Continual Reinforcement Learning with Implicit Generative Replay for Autonomous Driving

Abstract

Reinforcement learning (RL) has demonstrated significant potential in autonomous driving. However, its low sample efficiency and heavy reliance on environmental feedback pose challenges in handling highly dynamic and uncertain traffic scenarios, especially in pixel-based settings. In this paper, we propose a two-stage continual RL framework with implicit generative replay to address dynamic requirements in complex traffic situations. Based on the shared structures with parameterized skills and abstract transitions, we design a latent fine-tuning mechanism that facilitates steady policy improvement and avoids local optima. Additionally, we integrate diffusion-based implicit generative replay to enhance online experiences in latent space, promoting thorough exploration and exploitation. We also implement a dynamically decayed sampling ratio to effectively blend training data, mitigating distribution shifts’ impact and managing time costs. Extensive validation on complex and dynamic driving tasks shows that our approach significantly surpasses previous methods in both learning efficiency and generalization.

Cite

Text

Deng et al. "Continual Reinforcement Learning with Implicit Generative Replay for Autonomous Driving." European Conference on Computer Vision Workshops, 2024. doi:10.1007/978-3-031-91767-7_4

Markdown

[Deng et al. "Continual Reinforcement Learning with Implicit Generative Replay for Autonomous Driving." European Conference on Computer Vision Workshops, 2024.](https://mlanthology.org/eccvw/2024/deng2024eccvw-continual/) doi:10.1007/978-3-031-91767-7_4

BibTeX

@inproceedings{deng2024eccvw-continual,
  title     = {{Continual Reinforcement Learning with Implicit Generative Replay for Autonomous Driving}},
  author    = {Deng, Qi and Li, Ruyang and Hu, Qifu and Zhang, Tengfei and Zhang, Heng},
  booktitle = {European Conference on Computer Vision Workshops},
  year      = {2024},
  pages     = {47-64},
  doi       = {10.1007/978-3-031-91767-7_4},
  url       = {https://mlanthology.org/eccvw/2024/deng2024eccvw-continual/}
}