Continual Visual Reinforcement Learning with a Life-Long World Model
Abstract
Learning physical dynamics in a series of non-stationary environments is a challenging but essential task for model-based reinforcement learning (MBRL) with visual inputs. It requires the agent to consistently adapt to novel tasks without forgetting previous knowledge. In this paper, we present a new continual learning approach for visual dynamics modeling and explore its efficacy in visual control. The key assumption is that an ideal world model can provide a non-forgetting environment simulator, which enables the agent to optimize the policy in a multi-task learning manner based on the imagined trajectories from the world model. To this end, we first introduce the life-long world model, which learns task-specific latent dynamics using a mixture of Gaussians and incorporates generative experience replay to mitigate catastrophic forgetting. Then, we further address the value estimation challenge for previous tasks with the exploratory-conservative behavior learning approach. Our model remarkably outperforms the straightforward combinations of existing continual learning and visual RL algorithms on DeepMind Control Suite and Meta-World benchmarks with continual visual control tasks.
Cite
Text
Pan et al. "Continual Visual Reinforcement Learning with a Life-Long World Model." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2025. doi:10.1007/978-3-032-06106-5_9Markdown
[Pan et al. "Continual Visual Reinforcement Learning with a Life-Long World Model." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2025.](https://mlanthology.org/ecmlpkdd/2025/pan2025ecmlpkdd-continual/) doi:10.1007/978-3-032-06106-5_9BibTeX
@inproceedings{pan2025ecmlpkdd-continual,
title = {{Continual Visual Reinforcement Learning with a Life-Long World Model}},
author = {Pan, Minting and Zhang, Wendong and Chen, Geng and Zhu, Xiangming and Gao, Siyu and Wang, Yunbo and Yang, Xiaokang},
booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
year = {2025},
pages = {146-162},
doi = {10.1007/978-3-032-06106-5_9},
url = {https://mlanthology.org/ecmlpkdd/2025/pan2025ecmlpkdd-continual/}
}