HyperAdam: A Learnable Task-Adaptive Adam for Network Training

AAAI 2019 pp. 5297-5304

doi:10.1609/AAAI.V33I01.33015297 /aaai/2019/wang2019aaai-hyperadam/

Abstract

Deep neural networks are traditionally trained using humandesigned stochastic optimization algorithms, such as SGD and Adam. Recently, the approach of learning to optimize network parameters has emerged as a promising research topic. However, these learned black-box optimizers sometimes do not fully utilize the experience in human-designed optimizers, therefore have limitation in generalization ability. In this paper, a new optimizer, dubbed as HyperAdam, is proposed that combines the idea of “learning to optimize” and traditional Adam optimizer. Given a network for training, its parameter update in each iteration generated by HyperAdam is an adaptive combination of multiple updates generated by Adam with varying decay rates . The combination weights and decay rates in HyperAdam are adaptively learned depending on the task. HyperAdam is modeled as a recurrent neural network with AdamCell, WeightCell and StateCell. It is justified to be state-of-the-art for various network training, such as multilayer perceptron, CNN and LSTM.

PDF AAAI Semantic Scholar

Cite

Text

Wang et al. "HyperAdam: A Learnable Task-Adaptive Adam for Network Training." AAAI Conference on Artificial Intelligence, 2019. doi:10.1609/AAAI.V33I01.33015297

Markdown

[Wang et al. "HyperAdam: A Learnable Task-Adaptive Adam for Network Training." AAAI Conference on Artificial Intelligence, 2019.](https://mlanthology.org/aaai/2019/wang2019aaai-hyperadam/) doi:10.1609/AAAI.V33I01.33015297

BibTeX

@inproceedings{wang2019aaai-hyperadam,
  title     = {{HyperAdam: A Learnable Task-Adaptive Adam for Network Training}},
  author    = {Wang, Shipeng and Sun, Jian and Xu, Zongben},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2019},
  pages     = {5297-5304},
  doi       = {10.1609/AAAI.V33I01.33015297},
  url       = {https://mlanthology.org/aaai/2019/wang2019aaai-hyperadam/}
}