Task-Specific Exploration in Meta-Reinforcement Learning via Task Reconstruction

Stoican, Radu; Cangelosi, Angelo; Goerick, Christian; Weisswange, Thomas H

Task-Specific Exploration in Meta-Reinforcement Learning via Task Reconstruction

Radu Stoican, Angelo Cangelosi, Christian Goerick, Thomas H Weisswange

TMLR 2026

/tmlr/2026/stoican2026tmlr-taskspecific/

Abstract

Reinforcement learning trains policies specialized for a single task. Meta-reinforcement learning (meta-RL) improves upon this by leveraging prior experience to train policies for few-shot adaptation to new tasks. However, existing meta-RL approaches often struggle to explore and learn tasks effectively. We introduce a novel meta-RL algorithm that learns to learn task-specific exploration policies for sample-efficient few-shot adaptation. We achieve this through task reconstruction, an original method for learning to identify and collect small but informative datasets from tasks. To leverage these datasets, we also propose learning a meta-reward that encourages policies to learn to adapt. Empirical evaluations demonstrate that our algorithm achieves higher returns than existing meta-RL methods. Additionally, we show that even with full task information, adaptation is more challenging than previously assumed. However, policies trained with our meta-reward adapt to new tasks successfully.

PDF TMLR OpenReview Code Semantic Scholar

Cite

Text

Stoican et al. "Task-Specific Exploration in Meta-Reinforcement Learning via Task Reconstruction." Transactions on Machine Learning Research, 2026.

Markdown

[Stoican et al. "Task-Specific Exploration in Meta-Reinforcement Learning via Task Reconstruction." Transactions on Machine Learning Research, 2026.](https://mlanthology.org/tmlr/2026/stoican2026tmlr-taskspecific/)

BibTeX

@article{stoican2026tmlr-taskspecific,
  title     = {{Task-Specific Exploration in Meta-Reinforcement Learning via Task Reconstruction}},
  author    = {Stoican, Radu and Cangelosi, Angelo and Goerick, Christian and Weisswange, Thomas H},
  journal   = {Transactions on Machine Learning Research},
  year      = {2026},
  url       = {https://mlanthology.org/tmlr/2026/stoican2026tmlr-taskspecific/}
}