Learning Unitary Operators with Help from U(n)
Abstract
A major challenge in the training of recurrent neural networks is the so-called vanishing or exploding gradient problem. The use of a norm-preserving transition operator can address this issue, but parametrization is challenging. In this work we focus on unitary operators and describe a parametrization using the Lie algebra u(n) associated with the Lie group U(n) of n × n unitary matrices. The exponential map provides a correspondence between these spaces, and allows us to define a unitary matrix using n2 real coefficients relative to a basis of the Lie algebra. The parametrization is closed under additive updates of these coefficients, and thus provides a simple space in which to do gradient descent. We demonstrate the effectiveness of this parametrization on the problem of learning arbitrary unitary operators, comparing to several baselines and outperforming a recently-proposed lower-dimensional parametrization. We additionally use our parametrization to generalize a recently-proposed unitary recurrent neural network to arbitrary unitary matrices, using it to solve standard long-memory tasks.
Cite
Text
Hyland and Rätsch. "Learning Unitary Operators with Help from U(n)." AAAI Conference on Artificial Intelligence, 2017. doi:10.1609/AAAI.V31I1.10928Markdown
[Hyland and Rätsch. "Learning Unitary Operators with Help from U(n)." AAAI Conference on Artificial Intelligence, 2017.](https://mlanthology.org/aaai/2017/hyland2017aaai-learning/) doi:10.1609/AAAI.V31I1.10928BibTeX
@inproceedings{hyland2017aaai-learning,
title = {{Learning Unitary Operators with Help from U(n)}},
author = {Hyland, Stephanie L. and Rätsch, Gunnar},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2017},
pages = {2050-2058},
doi = {10.1609/AAAI.V31I1.10928},
url = {https://mlanthology.org/aaai/2017/hyland2017aaai-learning/}
}