Mega-Reward: Achieving Human-Level Play Without Extrinsic Rewards
Abstract
Intrinsic rewards were introduced to simulate how human intelligence works; they are usually evaluated by intrinsically-motivated play, i.e., playing games without extrinsic rewards but evaluated with extrinsic rewards. However, none of the existing intrinsic reward approaches can achieve human-level performance under this very challenging setting of intrinsically-motivated play. In this work, we propose a novel megalomania-driven intrinsic reward (called mega-reward), which, to our knowledge, is the first approach that achieves human-level performance in intrinsically-motivated play. Intuitively, mega-reward comes from the observation that infants' intelligence develops when they try to gain more control on entities in an environment; therefore, mega-reward aims to maximize the control capabilities of agents on given entities in a given environment. To formalize mega-reward, a relational transition model is proposed to bridge the gaps between direct and latent control. Experimental studies show that mega-reward (i) can greatly outperform all state-of-the-art intrinsic reward approaches, (ii) generally achieves the same level of performance as Ex-PPO and professional human-level scores, and (iii) has also a superior performance when it is incorporated with extrinsic rewards.
Cite
Text
Song et al. "Mega-Reward: Achieving Human-Level Play Without Extrinsic Rewards." AAAI Conference on Artificial Intelligence, 2020. doi:10.1609/AAAI.V34I04.6040Markdown
[Song et al. "Mega-Reward: Achieving Human-Level Play Without Extrinsic Rewards." AAAI Conference on Artificial Intelligence, 2020.](https://mlanthology.org/aaai/2020/song2020aaai-mega/) doi:10.1609/AAAI.V34I04.6040BibTeX
@inproceedings{song2020aaai-mega,
title = {{Mega-Reward: Achieving Human-Level Play Without Extrinsic Rewards}},
author = {Song, Yuhang and Wang, Jianyi and Lukasiewicz, Thomas and Xu, Zhenghua and Zhang, Shangtong and Wojcicki, Andrzej and Xu, Mai},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2020},
pages = {5826-5833},
doi = {10.1609/AAAI.V34I04.6040},
url = {https://mlanthology.org/aaai/2020/song2020aaai-mega/}
}