Dueling Network Architectures for Deep Reinforcement Learning

Ziyu Wang, Tom Schaul, Matteo Hessel, Hado Hasselt, Marc Lanctot, Nando Freitas

ICML 2016 pp. 1995-2003

/icml/2016/wang2016icml-dueling/

Abstract

In recent years there have been many successes of using deep representations in reinforcement learning. Still, many of these applications use conventional architectures, such as convolutional networks, LSTMs, or auto-encoders. In this paper, we present a new neural network architecture for model-free reinforcement learning. Our dueling network represents two separate estimators: one for the state value function and one for the state-dependent action advantage function. The main benefit of this factoring is to generalize learning across actions without imposing any change to the underlying reinforcement learning algorithm. Our results show that this architecture leads to better policy evaluation in the presence of many similar-valued actions. Moreover, the dueling architecture enables our RL agent to outperform the state-of-the-art on the Atari 2600 domain.

PDF ICML Semantic Scholar

Cite

Text

Wang et al. "Dueling Network Architectures for Deep Reinforcement Learning." International Conference on Machine Learning, 2016.

Markdown

[Wang et al. "Dueling Network Architectures for Deep Reinforcement Learning." International Conference on Machine Learning, 2016.](https://mlanthology.org/icml/2016/wang2016icml-dueling/)

BibTeX

@inproceedings{wang2016icml-dueling,
  title     = {{Dueling Network Architectures for Deep Reinforcement Learning}},
  author    = {Wang, Ziyu and Schaul, Tom and Hessel, Matteo and Hasselt, Hado and Lanctot, Marc and Freitas, Nando},
  booktitle = {International Conference on Machine Learning},
  year      = {2016},
  pages     = {1995-2003},
  volume    = {48},
  url       = {https://mlanthology.org/icml/2016/wang2016icml-dueling/}
}