ExTaSem! Extending, Taxonomizing and Semantifying Domain Terminologies

Abstract

We introduce ExTaSem!, a novel approach for the automatic learning of lexical taxonomies from domain terminologies. First, we exploit a very large semantic network to collect housands of in-domain textual definitions. Second, we extract (hyponym, hypernym) pairs from each definition with a CRF-based algorithm trained on manually-validated data. Finally, we introduce a graph induction procedure which constructs a full-fledged taxonomy where each edge is weighted according to its domain pertinence. ExTaSem! achieves state-of-the-art results in the following taxonomy evaluation experiments: (1) Hypernym discovery, (2) Reconstructing gold standard taxonomies, and (3) Taxonomy quality according to structural measures. We release weighted taxonomies for six domains for the use and scrutiny of the community.

Cite

Text

Anke et al. "ExTaSem! Extending, Taxonomizing and Semantifying Domain Terminologies." AAAI Conference on Artificial Intelligence, 2016. doi:10.1609/AAAI.V30I1.10330

Markdown

[Anke et al. "ExTaSem! Extending, Taxonomizing and Semantifying Domain Terminologies." AAAI Conference on Artificial Intelligence, 2016.](https://mlanthology.org/aaai/2016/anke2016aaai-extasem/) doi:10.1609/AAAI.V30I1.10330

BibTeX

@inproceedings{anke2016aaai-extasem,
  title     = {{ExTaSem! Extending, Taxonomizing and Semantifying Domain Terminologies}},
  author    = {Anke, Luis Espinosa and Saggion, Horacio and Ronzano, Francesco and Navigli, Roberto},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2016},
  pages     = {2594-2600},
  doi       = {10.1609/AAAI.V30I1.10330},
  url       = {https://mlanthology.org/aaai/2016/anke2016aaai-extasem/}
}