Improving Term Extraction by System Combination Using Boosting

Vivaldi, Jordi; Màrquez, Lluís; Rodríguez, Horacio

doi:10.1007/3-540-44795-4_44

Improving Term Extraction by System Combination Using Boosting

Jordi Vivaldi, Lluís Màrquez, Horacio Rodríguez

ECML-PKDD 2001 pp. 515-526

doi:10.1007/3-540-44795-4_44 /ecmlpkdd/2001/vivaldi2001ecml-improving/

Abstract

Term extraction is the task of automatically detecting, from textual corpora, lexical units that designate concepts in thematically restricted domains (e.g. medicine). Current systems for term extraction integrate linguistic and statistical cues to perform the detection of terms. The best results have been obtained when some kind of combination of simple base term extractors is performed [14]. In this paper it is shown that this combination can be further improved by posing an additional learning problem of how to find the best combination of base term extractors. Empirical results, using AdaBoost in the metalearning step, show that the ensemble constructed surpasses the performance of all individual extractors and simple voting schemes, obtaining significantly better accuracy figures at all levels of recall.

PDF ECML-PKDD Semantic Scholar

Cite

Text

Vivaldi et al. "Improving Term Extraction by System Combination Using Boosting." European Conference on Machine Learning, 2001. doi:10.1007/3-540-44795-4_44

Markdown

[Vivaldi et al. "Improving Term Extraction by System Combination Using Boosting." European Conference on Machine Learning, 2001.](https://mlanthology.org/ecmlpkdd/2001/vivaldi2001ecml-improving/) doi:10.1007/3-540-44795-4_44

BibTeX

@inproceedings{vivaldi2001ecml-improving,
  title     = {{Improving Term Extraction by System Combination Using Boosting}},
  author    = {Vivaldi, Jordi and Màrquez, Lluís and Rodríguez, Horacio},
  booktitle = {European Conference on Machine Learning},
  year      = {2001},
  pages     = {515-526},
  doi       = {10.1007/3-540-44795-4_44},
  url       = {https://mlanthology.org/ecmlpkdd/2001/vivaldi2001ecml-improving/}
}