Generalized Value Iteration Networks: Life Beyond Lattices

Abstract

In this paper, we introduce a generalized value iteration network (GVIN), which is an end-to-end neural network planning module. GVIN emulates the value iteration algorithm by using a novel graph convolution operator, which enables GVIN to learn and plan on irregular spatial graphs. We propose three novel differentiable kernels as graph convolution operators and show that the embedding-based kernel achieves the best performance. Furthermore, we present episodic Q-learning, an improvement upon traditional n-step Q-learning that stabilizes training for VIN and GVIN. Lastly, we evaluate GVIN on planning problems in 2D mazes, irregular graphs, and real-world street networks, showing that GVIN generalizes well for both arbitrary graphs and unseen graphs of larger scaleand outperforms a naive generalization of VIN (discretizing a spatial graph into a 2D image).

Cite

Text

Niu et al. "Generalized Value Iteration Networks: Life Beyond Lattices." AAAI Conference on Artificial Intelligence, 2018. doi:10.1609/AAAI.V32I1.12081

Markdown

[Niu et al. "Generalized Value Iteration Networks: Life Beyond Lattices." AAAI Conference on Artificial Intelligence, 2018.](https://mlanthology.org/aaai/2018/niu2018aaai-generalized/) doi:10.1609/AAAI.V32I1.12081

BibTeX

@inproceedings{niu2018aaai-generalized,
  title     = {{Generalized Value Iteration Networks: Life Beyond Lattices}},
  author    = {Niu, Sufeng and Chen, Siheng and Guo, Hanyu and Targonski, Colin and Smith, Melissa C. and Kovacevic, Jelena},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2018},
  pages     = {6246-6253},
  doi       = {10.1609/AAAI.V32I1.12081},
  url       = {https://mlanthology.org/aaai/2018/niu2018aaai-generalized/}
}