Effective Graph Context Representation for Document-Level Machine Translation

Abstract

Document-level neural machine translation (DocNMT) universally encodes several local sentences or the entire document. Thus, DocNMT does not consider the relevance of document-level contextual information, for example, some context (i.e., content words, logical order, and co-occurrence relation) is more effective than another auxiliary context (i.e., functional and auxiliary words). To address this issue, we first utilize the word frequency information to recognize content words in the input document, and then use heuristical relations to summarize content words and sentences as a graph structure without relying on external syntactic knowledge. Furthermore, we apply graph attention networks to this graph structure to learn its feature representation, which allows DocNMT to more effectively capture the document-level context. Experimental results on several widely-used document-level benchmarks demonstrated the effectiveness of the proposed approach.

Cite

Text

Chen et al. "Effective Graph Context Representation for Document-Level Machine Translation." International Joint Conference on Artificial Intelligence, 2022. doi:10.24963/IJCAI.2022/566

Markdown

[Chen et al. "Effective Graph Context Representation for Document-Level Machine Translation." International Joint Conference on Artificial Intelligence, 2022.](https://mlanthology.org/ijcai/2022/chen2022ijcai-effective/) doi:10.24963/IJCAI.2022/566

BibTeX

@inproceedings{chen2022ijcai-effective,
  title     = {{Effective Graph Context Representation for Document-Level Machine Translation}},
  author    = {Chen, Kehai and Yang, Muyun and Utiyama, Masao and Sumita, Eiichiro and Wang, Rui and Zhang, Min},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2022},
  pages     = {4079-4085},
  doi       = {10.24963/IJCAI.2022/566},
  url       = {https://mlanthology.org/ijcai/2022/chen2022ijcai-effective/}
}