On-Policy Algorithms for Continual Reinforcement Learning (Student Abstract)

Dziarmaga, Tadeusz; Arczewski, Tomasz; Mazur, Marcin; Wolczyk, Maciej

doi:10.1609/AAAI.V39I28.35251

On-Policy Algorithms for Continual Reinforcement Learning (Student Abstract)

Tadeusz Dziarmaga, Tomasz Arczewski, Marcin Mazur, Maciej Wolczyk

AAAI 2025 pp. 29359-29361

doi:10.1609/AAAI.V39I28.35251 /aaai/2025/dziarmaga2025aaai-policy/

Abstract

Continual reinforcement learning (CRL) is the study of optimal strategies for maximizing rewards in sequential environments that change over time. This is particularly crucial in domains such as robotics, where the operational environment is inherently dynamic and subject to continual change. Nevertheless, research in this area has thus far concentrated on off-policy algorithms with replay buffers that are capable of amortizing the impact of distribution shifts. Such an approach is not feasible with on-policy reinforcement learning algorithms that learn solely from the data obtained from the current policy. In this paper, we examine the performance of proximal policy optimization (PPO), a prevalent on-policy reinforcement learning (RL) algorithm, in a classical CRL benchmark. Our findings suggest that the current methods are suboptimal in terms of average performance. Nevertheless, they demonstrate encouraging competitive outcomes with respect to forward transfer and forgetting metrics. This highlights the need for further research into continual on-policy reinforcement learning. The source code is available at https://github.com/Teddy298/continualworld-ppo.

PDF AAAI Semantic Scholar

Cite

Text

Dziarmaga et al. "On-Policy Algorithms for Continual Reinforcement Learning (Student Abstract)." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I28.35251

Markdown

[Dziarmaga et al. "On-Policy Algorithms for Continual Reinforcement Learning (Student Abstract)." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/dziarmaga2025aaai-policy/) doi:10.1609/AAAI.V39I28.35251

BibTeX

@inproceedings{dziarmaga2025aaai-policy,
  title     = {{On-Policy Algorithms for Continual Reinforcement Learning (Student Abstract)}},
  author    = {Dziarmaga, Tadeusz and Arczewski, Tomasz and Mazur, Marcin and Wolczyk, Maciej},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2025},
  pages     = {29359-29361},
  doi       = {10.1609/AAAI.V39I28.35251},
  url       = {https://mlanthology.org/aaai/2025/dziarmaga2025aaai-policy/}
}