Task-Specific Exploration in Meta-Reinforcement Learning via Task Reconstruction
Abstract
Reinforcement learning trains policies specialized for a single task. Meta-reinforcement learning (meta-RL) improves upon this by leveraging prior experience to train policies for few-shot adaptation to new tasks. However, existing meta-RL approaches often struggle to explore and learn tasks effectively. We introduce a novel meta-RL algorithm that learns to learn task-specific exploration policies for sample-efficient few-shot adaptation. We achieve this through task reconstruction, an original method for learning to identify and collect small but informative datasets from tasks. To leverage these datasets, we also propose learning a meta-reward that encourages policies to learn to adapt. Empirical evaluations demonstrate that our algorithm achieves higher returns than existing meta-RL methods. Additionally, we show that even with full task information, adaptation is more challenging than previously assumed. However, policies trained with our meta-reward adapt to new tasks successfully.
Cite
Text
Stoican et al. "Task-Specific Exploration in Meta-Reinforcement Learning via Task Reconstruction." Transactions on Machine Learning Research, 2026.Markdown
[Stoican et al. "Task-Specific Exploration in Meta-Reinforcement Learning via Task Reconstruction." Transactions on Machine Learning Research, 2026.](https://mlanthology.org/tmlr/2026/stoican2026tmlr-taskspecific/)BibTeX
@article{stoican2026tmlr-taskspecific,
title = {{Task-Specific Exploration in Meta-Reinforcement Learning via Task Reconstruction}},
author = {Stoican, Radu and Cangelosi, Angelo and Goerick, Christian and Weisswange, Thomas H},
journal = {Transactions on Machine Learning Research},
year = {2026},
url = {https://mlanthology.org/tmlr/2026/stoican2026tmlr-taskspecific/}
}