Modeling Lexical Cohesion for Document-Level Machine Translation

Abstract

Lexical cohesion arises from a chain of lexical items that establish links between sentences in a text. In this paper we propose three different models to capture lexical cohesion for document-level machine translation: (a) a direct reward model where translation hypotheses are rewarded whenever lexical cohesion devices occur in them, (b) a conditional probability model where the appropriateness of using lexical cohesion devices is measured, and (c) a mutual information trigger model where a lexical cohesion relation is considered as a trigger pair and the strength of the association between the trigger and the triggered item is estimated by mutual information. We integrate the three models into hierarchical phrase-based machine translation and evaluate their effectiveness on the NIST Chinese-English translation tasks with large-scale training data. Experiment results show that all three models can achieve substantial improvements over the baseline and that the mutual information trigger model performs better than the others.

Cite

Text

Xiong et al. "Modeling Lexical Cohesion for Document-Level Machine Translation." International Joint Conference on Artificial Intelligence, 2013.

Markdown

[Xiong et al. "Modeling Lexical Cohesion for Document-Level Machine Translation." International Joint Conference on Artificial Intelligence, 2013.](https://mlanthology.org/ijcai/2013/xiong2013ijcai-modeling/)

BibTeX

@inproceedings{xiong2013ijcai-modeling,
  title     = {{Modeling Lexical Cohesion for Document-Level Machine Translation}},
  author    = {Xiong, Deyi and Ben, Guosheng and Zhang, Min and Lv, Yajuan and Liu, Qun},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2013},
  pages     = {2183-2189},
  url       = {https://mlanthology.org/ijcai/2013/xiong2013ijcai-modeling/}
}