Word Sense Disambiguation for All Words Without Hard Labor

Zhong, Zhi; Ng, Hwee Tou

Word Sense Disambiguation for All Words Without Hard Labor

IJCAI 2009 pp. 1616-1622

/ijcai/2009/zhong2009ijcai-word/

Abstract

While the most accurate word sense disambiguation systems are built using supervised learning from sense-tagged data, scaling them up to all words of a language has proved elusive, since preparing a sense-tagged corpus for all words of a language is time-consuming and human labor intensive. In this paper, we propose and implement a completely automatic approach to scale up word sense disambiguation to all words of English. Our approach relies on English-Chinese parallel corpora, English-Chinese bilingual dictionaries, and automatic methods of finding synonyms of Chinese words. No additional human sense annotations or word translations are needed. We conducted a large-scale empirical evaluation on more than 29,000 noun tokens in English texts annotated in OntoNotes 2.0, based on its coarse-grained sense inventory. The evaluation results show that our approach is able to achieve high accuracy, outperforming the first-sense baseline and coming close to a prior reported approach that requires manual human efforts to provide Chinese translations of English senses. Zhi Zhong, Hwee Tou Ng

PDF Semantic Scholar

Cite

Text

Zhong and Ng. "Word Sense Disambiguation for All Words Without Hard Labor." International Joint Conference on Artificial Intelligence, 2009.

Markdown

[Zhong and Ng. "Word Sense Disambiguation for All Words Without Hard Labor." International Joint Conference on Artificial Intelligence, 2009.](https://mlanthology.org/ijcai/2009/zhong2009ijcai-word/)

BibTeX

@inproceedings{zhong2009ijcai-word,
  title     = {{Word Sense Disambiguation for All Words Without Hard Labor}},
  author    = {Zhong, Zhi and Ng, Hwee Tou},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2009},
  pages     = {1616-1622},
  url       = {https://mlanthology.org/ijcai/2009/zhong2009ijcai-word/}
}