Multiway Attention Networks for Modeling Sentence Pairs

Abstract

Modeling sentence pairs plays the vital role for judging the relationship between two sentences, such as paraphrase identification, natural language inference, and answer sentence selection. Previous work achieves very promising results using neural networks with attention mechanism. In this paper, we propose the multiway attention networks which employ multiple attention functions to match sentence pairs under the matching-aggregation framework. Specifically, we design four attention functions to match words in corresponding sentences. Then, we aggregate the matching information from each function, and combine the information from all functions to obtain the final representation. Experimental results demonstrate that the proposed multiway attention networks improve the result on the Quora Question Pairs, SNLI, MultiNLI, and answer sentence selection task on the SQuAD dataset.

Cite

Text

Tan et al. "Multiway Attention Networks for Modeling Sentence Pairs." International Joint Conference on Artificial Intelligence, 2018. doi:10.24963/IJCAI.2018/613

Markdown

[Tan et al. "Multiway Attention Networks for Modeling Sentence Pairs." International Joint Conference on Artificial Intelligence, 2018.](https://mlanthology.org/ijcai/2018/tan2018ijcai-multiway/) doi:10.24963/IJCAI.2018/613

BibTeX

@inproceedings{tan2018ijcai-multiway,
  title     = {{Multiway Attention Networks for Modeling Sentence Pairs}},
  author    = {Tan, Chuanqi and Wei, Furu and Wang, Wenhui and Lv, Weifeng and Zhou, Ming},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2018},
  pages     = {4411-4417},
  doi       = {10.24963/IJCAI.2018/613},
  url       = {https://mlanthology.org/ijcai/2018/tan2018ijcai-multiway/}
}