Agreement on Target-Bidirectional LSTMs for Sequence-to-Sequence Learning

Abstract

Recurrent neural networks, particularly the long short- term memory networks, are extremely appealing for sequence-to-sequence learning tasks. Despite their great success, they typically suffer from a fundamental short- coming: they are prone to generate unbalanced targets with good prefixes but bad suffixes, and thus perfor- mance suffers when dealing with long sequences. We propose a simple yet effective approach to overcome this shortcoming. Our approach relies on the agreement between a pair of target-directional LSTMs, which generates more balanced targets. In addition, we develop two efficient approximate search methods for agreement that are empirically shown to be almost optimal in terms of sequence-level losses. Extensive experiments were performed on two standard sequence-to-sequence trans- duction tasks: machine transliteration and grapheme-to- phoneme transformation. The results show that the proposed approach achieves consistent and substantial im- provements, compared to six state-of-the-art systems. In particular, our approach outperforms the best reported error rates by a margin (up to 9% relative gains) on the grapheme-to-phoneme task.

Cite

Text

Liu et al. "Agreement on Target-Bidirectional LSTMs for Sequence-to-Sequence Learning." AAAI Conference on Artificial Intelligence, 2016. doi:10.1609/AAAI.V30I1.10327

Markdown

[Liu et al. "Agreement on Target-Bidirectional LSTMs for Sequence-to-Sequence Learning." AAAI Conference on Artificial Intelligence, 2016.](https://mlanthology.org/aaai/2016/liu2016aaai-agreement/) doi:10.1609/AAAI.V30I1.10327

BibTeX

@inproceedings{liu2016aaai-agreement,
  title     = {{Agreement on Target-Bidirectional LSTMs for Sequence-to-Sequence Learning}},
  author    = {Liu, Lemao and Finch, Andrew M. and Utiyama, Masao and Sumita, Eiichiro},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2016},
  pages     = {2630-2637},
  doi       = {10.1609/AAAI.V30I1.10327},
  url       = {https://mlanthology.org/aaai/2016/liu2016aaai-agreement/}
}