Self-Improving Skill Learning for Robust Skill-Based Meta-Reinforcement Learning

Han, Seungyul; Lee, Sanghyeon; Bae, Sangjun; Park, Yisak

Self-Improving Skill Learning for Robust Skill-Based Meta-Reinforcement Learning

Seungyul Han, Sanghyeon Lee, Sangjun Bae, Yisak Park

ICLR 2026

/iclr/2026/han2026iclr-selfimproving/

Abstract

Meta-reinforcement learning (Meta-RL) facilitates rapid adaptation to unseen tasks but faces challenges in long-horizon environments. Skill-based approaches tackle this by decomposing state-action sequences into reusable skills and employing hierarchical decision-making. However, these methods are highly susceptible to noisy offline demonstrations, leading to unstable skill learning and degraded performance. To address this, we propose Self-Improving Skill Learning (SISL), which performs self-guided skill refinement using decoupled high-level and skill improvement policies, while applying skill prioritization via maximum return relabeling to focus updates on task-relevant trajectories, resulting in robust and stable adaptation even under noisy and suboptimal data. By mitigating the effect of noise, SISL achieves reliable skill learning and consistently outperforms other skill-based meta-RL methods on diverse long-horizon tasks. Our code is available at https://github.com/epsilog/SISL.

PDF ICLR OpenReview Semantic Scholar

Cite

Text

Han et al. "Self-Improving Skill Learning for Robust Skill-Based Meta-Reinforcement Learning." International Conference on Learning Representations, 2026.

Markdown

[Han et al. "Self-Improving Skill Learning for Robust Skill-Based Meta-Reinforcement Learning." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/han2026iclr-selfimproving/)

BibTeX

@inproceedings{han2026iclr-selfimproving,
  title     = {{Self-Improving Skill Learning for Robust Skill-Based Meta-Reinforcement Learning}},
  author    = {Han, Seungyul and Lee, Sanghyeon and Bae, Sangjun and Park, Yisak},
  booktitle = {International Conference on Learning Representations},
  year      = {2026},
  url       = {https://mlanthology.org/iclr/2026/han2026iclr-selfimproving/}
}