Enhancing Bilingual Lexicon Induction via Bi-Directional Translation Pair Retrieving

Abstract

Most Bilingual Lexicon Induction (BLI) methods retrieve word translation pairs by finding the closest target word for a given source word based on cross-lingual word embeddings (WEs). However, we find that solely retrieving translation from the source-to-target perspective leads to some false positive translation pairs, which significantly harm the precision of BLI. To address this problem, we propose a novel and effective method to improve translation pair retrieval in cross-lingual WEs. Specifically, we consider both source-side and target-side perspectives throughout the retrieval process to alleviate false positive word pairings that emanate from a single perspective. On a benchmark dataset of BLI, our proposed method achieves competitive performance compared to existing state-of-the-art (SOTA) methods. It demonstrates effectiveness and robustness across six experimental languages, including similar language pairs and distant language pairs, under both supervised and unsupervised settings.

Cite

Text

Ding et al. "Enhancing Bilingual Lexicon Induction via Bi-Directional Translation Pair Retrieving." AAAI Conference on Artificial Intelligence, 2024. doi:10.1609/AAAI.V38I16.29744

Markdown

[Ding et al. "Enhancing Bilingual Lexicon Induction via Bi-Directional Translation Pair Retrieving." AAAI Conference on Artificial Intelligence, 2024.](https://mlanthology.org/aaai/2024/ding2024aaai-enhancing/) doi:10.1609/AAAI.V38I16.29744

BibTeX

@inproceedings{ding2024aaai-enhancing,
  title     = {{Enhancing Bilingual Lexicon Induction via Bi-Directional Translation Pair Retrieving}},
  author    = {Ding, Qiuyu and Cao, Hailong and Zhao, Tiejun},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2024},
  pages     = {17898-17906},
  doi       = {10.1609/AAAI.V38I16.29744},
  url       = {https://mlanthology.org/aaai/2024/ding2024aaai-enhancing/}
}