Universal Value Iteration Networks: When Spatially-Invariant Is Not Universal

Zhang, Li; Li, Xin; Chen, Sen; Zang, Hongyu; Huang, Jie; Wang, Mingzhong

doi:10.1609/AAAI.V34I04.6157

Universal Value Iteration Networks: When Spatially-Invariant Is Not Universal

Li Zhang, Xin Li, Sen Chen, Hongyu Zang, Jie Huang, Mingzhong Wang

AAAI 2020 pp. 6778-6785

doi:10.1609/AAAI.V34I04.6157 /aaai/2020/zhang2020aaai-universal-a/

Abstract

In this paper, we first formally define the problem set of spatially invariant Markov Decision Processes (MDPs), and show that Value Iteration Networks (VIN) and its extensions are computationally bounded to it due to the use of the convolution kernel. To generalize VIN to spatially variant MDPs, we propose Universal Value Iteration Networks (UVIN). In comparison with VIN, UVIN automatically learns a flexible but compact network structure to encode the transition dynamics of the problems and support the differentiable planning module. We evaluate UVIN with both spatially invariant and spatially variant tasks, including navigation in regular maze, chessboard maze, and Mars, and Minecraft item syntheses. Results show that UVIN can achieve similar performance as VIN and its extensions on spatially invariant tasks, and significantly outperforms other models on more general problems.

PDF AAAI Semantic Scholar

Cite

Text

Zhang et al. "Universal Value Iteration Networks: When Spatially-Invariant Is Not Universal." AAAI Conference on Artificial Intelligence, 2020. doi:10.1609/AAAI.V34I04.6157

Markdown

[Zhang et al. "Universal Value Iteration Networks: When Spatially-Invariant Is Not Universal." AAAI Conference on Artificial Intelligence, 2020.](https://mlanthology.org/aaai/2020/zhang2020aaai-universal-a/) doi:10.1609/AAAI.V34I04.6157

BibTeX

@inproceedings{zhang2020aaai-universal-a,
  title     = {{Universal Value Iteration Networks: When Spatially-Invariant Is Not Universal}},
  author    = {Zhang, Li and Li, Xin and Chen, Sen and Zang, Hongyu and Huang, Jie and Wang, Mingzhong},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2020},
  pages     = {6778-6785},
  doi       = {10.1609/AAAI.V34I04.6157},
  url       = {https://mlanthology.org/aaai/2020/zhang2020aaai-universal-a/}
}