Self-Improving Skill Learning for Robust Skill-Based Meta-Reinforcement Learning
Abstract
Meta-reinforcement learning (Meta-RL) facilitates rapid adaptation to unseen tasks but faces challenges in long-horizon environments. Skill-based approaches tackle this by decomposing state-action sequences into reusable skills and employing hierarchical decision-making. However, these methods are highly susceptible to noisy offline demonstrations, leading to unstable skill learning and degraded performance. To address this, we propose Self-Improving Skill Learning (SISL), which performs self-guided skill refinement using decoupled high-level and skill improvement policies, while applying skill prioritization via maximum return relabeling to focus updates on task-relevant trajectories, resulting in robust and stable adaptation even under noisy and suboptimal data. By mitigating the effect of noise, SISL achieves reliable skill learning and consistently outperforms other skill-based meta-RL methods on diverse long-horizon tasks. Our code is available at https://github.com/epsilog/SISL.
Cite
Text
Han et al. "Self-Improving Skill Learning for Robust Skill-Based Meta-Reinforcement Learning." International Conference on Learning Representations, 2026.Markdown
[Han et al. "Self-Improving Skill Learning for Robust Skill-Based Meta-Reinforcement Learning." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/han2026iclr-selfimproving/)BibTeX
@inproceedings{han2026iclr-selfimproving,
title = {{Self-Improving Skill Learning for Robust Skill-Based Meta-Reinforcement Learning}},
author = {Han, Seungyul and Lee, Sanghyeon and Bae, Sangjun and Park, Yisak},
booktitle = {International Conference on Learning Representations},
year = {2026},
url = {https://mlanthology.org/iclr/2026/han2026iclr-selfimproving/}
}