Strict Subgoal Execution: Reliable Long-Horizon Planning in Hierarchical Reinforcement Learning

Abstract

Long-horizon goal-conditioned tasks pose fundamental challenges for reinforcement learning (RL), particularly when goals are distant and rewards are sparse. While hierarchical and graph-based methods offer partial solutions, their reliance on conventional hindsight relabeling often fails to correct subgoal infeasibility, leading to inefficient high-level planning. To address this, we propose Strict Subgoal Execution (SSE), a graph-based hierarchical RL framework that integrates Frontier Experience Replay (FER) to separate unreachable from admissible subgoals and streamline high-level decision making. FER delineates the reachability frontier using failure and partial-success transitions, which identifies unreliable subgoals, increases subgoal reliability, and reduces unnecessary high-level decisions. Additionally, SSE employs a decoupled exploration policy to cover underexplored regions of the goal space and a path refinement that adjusts edge costs using observed low-level failures. Experimental results across diverse long-horizon benchmarks show that SSE consistently outperforms existing goal-conditioned and hierarchical RL methods in both efficiency and success rate.

Cite

Text

Han et al. "Strict Subgoal Execution: Reliable Long-Horizon Planning in Hierarchical Reinforcement Learning." International Conference on Learning Representations, 2026.

Markdown

[Han et al. "Strict Subgoal Execution: Reliable Long-Horizon Planning in Hierarchical Reinforcement Learning." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/han2026iclr-strict/)

BibTeX

@inproceedings{han2026iclr-strict,
  title     = {{Strict Subgoal Execution: Reliable Long-Horizon Planning in Hierarchical Reinforcement Learning}},
  author    = {Han, Seungyul and Hwang, Jaebak and Lee, Sanghyeon and Kim, Jeongmo},
  booktitle = {International Conference on Learning Representations},
  year      = {2026},
  url       = {https://mlanthology.org/iclr/2026/han2026iclr-strict/}
}