Word Sense Disambiguation for All Words Without Hard Labor
Abstract
While the most accurate word sense disambiguation systems are built using supervised learning from sense-tagged data, scaling them up to all words of a language has proved elusive, since preparing a sense-tagged corpus for all words of a language is time-consuming and human labor intensive. In this paper, we propose and implement a completely automatic approach to scale up word sense disambiguation to all words of English. Our approach relies on English-Chinese parallel corpora, English-Chinese bilingual dictionaries, and automatic methods of finding synonyms of Chinese words. No additional human sense annotations or word translations are needed. We conducted a large-scale empirical evaluation on more than 29,000 noun tokens in English texts annotated in OntoNotes 2.0, based on its coarse-grained sense inventory. The evaluation results show that our approach is able to achieve high accuracy, outperforming the first-sense baseline and coming close to a prior reported approach that requires manual human efforts to provide Chinese translations of English senses. Zhi Zhong, Hwee Tou Ng
Cite
Text
Zhong and Ng. "Word Sense Disambiguation for All Words Without Hard Labor." International Joint Conference on Artificial Intelligence, 2009.Markdown
[Zhong and Ng. "Word Sense Disambiguation for All Words Without Hard Labor." International Joint Conference on Artificial Intelligence, 2009.](https://mlanthology.org/ijcai/2009/zhong2009ijcai-word/)BibTeX
@inproceedings{zhong2009ijcai-word,
title = {{Word Sense Disambiguation for All Words Without Hard Labor}},
author = {Zhong, Zhi and Ng, Hwee Tou},
booktitle = {International Joint Conference on Artificial Intelligence},
year = {2009},
pages = {1616-1622},
url = {https://mlanthology.org/ijcai/2009/zhong2009ijcai-word/}
}