JueWu-MC: Playing Minecraft with Sample-Efficient Hierarchical Reinforcement Learning

Abstract

Learning rational behaviors in open-world games like Minecraft remains to be challenging for Reinforcement Learning (RL) research due to the compound challenge of partial observability, high-dimensional visual perception and delayed reward. To address this, we propose JueWu-MC, a sample-efficient hierarchical RL approach equipped with representation learning and imitation learning to deal with perception and exploration. Specifically, our approach includes two levels of hierarchy, where the high-level controller learns a policy to control over options and the low-level workers learn to solve each sub-task. To boost the learning of sub-tasks, we propose a combination of techniques including 1) action-aware representation learning which captures underlying relations between action and representation, 2) discriminator-based self-imitation learning for efficient exploration, and 3) ensemble behavior cloning with consistency filtering for policy robustness. Extensive experiments show that JueWu-MC significantly improves sample efficiency and outperforms a set of baselines by a large margin. Notably, we won the championship of the NeurIPS MineRL 2021 research competition and achieved the highest performance score ever.

Cite

Text

Lin et al. "JueWu-MC: Playing Minecraft with Sample-Efficient Hierarchical Reinforcement Learning." International Joint Conference on Artificial Intelligence, 2022. doi:10.24963/IJCAI.2022/452

Markdown

[Lin et al. "JueWu-MC: Playing Minecraft with Sample-Efficient Hierarchical Reinforcement Learning." International Joint Conference on Artificial Intelligence, 2022.](https://mlanthology.org/ijcai/2022/lin2022ijcai-juewu/) doi:10.24963/IJCAI.2022/452

BibTeX

@inproceedings{lin2022ijcai-juewu,
  title     = {{JueWu-MC: Playing Minecraft with Sample-Efficient Hierarchical Reinforcement Learning}},
  author    = {Lin, Zichuan and Li, Junyou and Shi, Jianing and Ye, Deheng and Fu, Qiang and Yang, Wei},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2022},
  pages     = {3257-3263},
  doi       = {10.24963/IJCAI.2022/452},
  url       = {https://mlanthology.org/ijcai/2022/lin2022ijcai-juewu/}
}