Point Set Registration for Unsupervised Bilingual Lexicon Induction

Cao, Hailong; Zhao, Tiejun

doi:10.24963/IJCAI.2018/555

Point Set Registration for Unsupervised Bilingual Lexicon Induction

Hailong Cao, Tiejun Zhao

IJCAI 2018 pp. 3991-3997

doi:10.24963/IJCAI.2018/555 /ijcai/2018/cao2018ijcai-point/

Abstract

Inspired by the observation that word embeddings exhibit isomorphic structure across languages, we propose a novel method to induce a bilingual lexicon from only two sets of word embeddings, which are trained on monolingual source and target data respectively. This is achieved by formulating the task as point set registration which is a more general problem. We show that a transformation from the source to the target embedding space can be learned automatically without any form of cross-lingual supervision. By properly adapting a traditional point set registration model to make it be suitable for processing word embeddings, we achieved state-of-the-art performance on the unsupervised bilingual lexicon induction task. The point set registration problem has been well-studied and can be solved by many elegant models, we thus opened up a new opportunity to capture the universal lexical semantic structure across languages.

PDF IJCAI Semantic Scholar

Cite

Text

Cao and Zhao. "Point Set Registration for Unsupervised Bilingual Lexicon Induction." International Joint Conference on Artificial Intelligence, 2018. doi:10.24963/IJCAI.2018/555

Markdown

[Cao and Zhao. "Point Set Registration for Unsupervised Bilingual Lexicon Induction." International Joint Conference on Artificial Intelligence, 2018.](https://mlanthology.org/ijcai/2018/cao2018ijcai-point/) doi:10.24963/IJCAI.2018/555

BibTeX

@inproceedings{cao2018ijcai-point,
  title     = {{Point Set Registration for Unsupervised Bilingual Lexicon Induction}},
  author    = {Cao, Hailong and Zhao, Tiejun},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2018},
  pages     = {3991-3997},
  doi       = {10.24963/IJCAI.2018/555},
  url       = {https://mlanthology.org/ijcai/2018/cao2018ijcai-point/}
}