Open-World Multi-Task Control Through Goal-Aware Representation Learning and Adaptive Horizon Prediction
Abstract
We study the problem of learning goal-conditioned policies in Minecraft, a popular, widely accessible yet challenging open-ended environment for developing human-level multi-task agents. We first identify two main challenges of learning such policies: 1) the indistinguishability of tasks from the state distribution, due to the vast scene diversity, and 2) the non-stationary nature of environment dynamics caused by the partial observability. To tackle the first challenge, we propose Goal-Sensitive Backbone (GSB) for the policy to encourage the emergence of goal-relevant visual state representations. To tackle the second challenge, the policy is further fueled by an adaptive horizon prediction module that helps alleviate the learning uncertainty brought by the non-stationary dynamics. Experiments on 20 Minecraft tasks show that our method significantly outperforms the best baseline so far; in many of them, we double the performance. Our ablation and exploratory studies then explain how our approach beat the counterparts and also unveil the surprising bonus of zero-shot generalization to new scenes (biomes). We hope our agent could help shed some light on learning goal-conditioned, multi-task agents in challenging, open-ended environments like Minecraft.
Cite
Text
Cai et al. "Open-World Multi-Task Control Through Goal-Aware Representation Learning and Adaptive Horizon Prediction." Conference on Computer Vision and Pattern Recognition, 2023. doi:10.1109/CVPR52729.2023.01320Markdown
[Cai et al. "Open-World Multi-Task Control Through Goal-Aware Representation Learning and Adaptive Horizon Prediction." Conference on Computer Vision and Pattern Recognition, 2023.](https://mlanthology.org/cvpr/2023/cai2023cvpr-openworld/) doi:10.1109/CVPR52729.2023.01320BibTeX
@inproceedings{cai2023cvpr-openworld,
title = {{Open-World Multi-Task Control Through Goal-Aware Representation Learning and Adaptive Horizon Prediction}},
author = {Cai, Shaofei and Wang, Zihao and Ma, Xiaojian and Liu, Anji and Liang, Yitao},
booktitle = {Conference on Computer Vision and Pattern Recognition},
year = {2023},
pages = {13734-13744},
doi = {10.1109/CVPR52729.2023.01320},
url = {https://mlanthology.org/cvpr/2023/cai2023cvpr-openworld/}
}