Unsupervised Methods for Determining Object and Relation Synonyms on the Web

Abstract

The task of identifying synonymous relations and objects, or synonym resolution, is critical for high-quality information extraction. This paper investigates synonym resolution in the context of unsupervised information extraction, where neither hand-tagged training examples nor domain knowledge is available. The paper presents a scalable, fully-implemented system that runs in O(KN log N) time in the number of extractions, N, and the maximum number of synonyms per word, K. The system, called RESOLVER, introduces a probabilistic relational model for predicting whether two strings are co-referential based on the similarity of the assertions containing them. On a set of two million assertions extracted from the Web, RESOLVER resolves objects with 78% precision and 68% recall, and resolves relations with 90% precision and 35% recall. Several variations of RESOLVER's probabilistic model are explored, and experiments demonstrate that under appropriate conditions these variations can improve F1 by 5%. An extension to the basic RESOLVER system allows it to handle polysemous names with 97% precision and 95% recall on a data set from the TREC corpus.

Cite

Text

Yates and Etzioni. "Unsupervised Methods for Determining Object and Relation Synonyms on the Web." Journal of Artificial Intelligence Research, 2009. doi:10.1613/JAIR.2772

Markdown

[Yates and Etzioni. "Unsupervised Methods for Determining Object and Relation Synonyms on the Web." Journal of Artificial Intelligence Research, 2009.](https://mlanthology.org/jair/2009/yates2009jair-unsupervised/) doi:10.1613/JAIR.2772

BibTeX

@article{yates2009jair-unsupervised,
  title     = {{Unsupervised Methods for Determining Object and Relation Synonyms on the Web}},
  author    = {Yates, Alexander and Etzioni, Oren},
  journal   = {Journal of Artificial Intelligence Research},
  year      = {2009},
  pages     = {255-296},
  doi       = {10.1613/JAIR.2772},
  volume    = {34},
  url       = {https://mlanthology.org/jair/2009/yates2009jair-unsupervised/}
}