Combining Error-Driven Pruning and Classification for Partial Parsing

Abstract

We present a new approach to partial parsing of natural language texts that relies on machine learning methods. The approach combines corpus-based grammar induction with a very simple pattern-matching algorithm and an optional constituent verification step. The grammar induction algorithm acquires a set of rules for each level of linguistic analysis using a new technique for errordriven pruning of treebank grammars. The constituent verification step employs standard inductive learning techniques as an additional precision-enhancing device. We evaluate the approach on four partial parsing data sets and find that performance is very good (over 93% precision and recall) for applications that require or prefer fairly simple constituent bracketing. As the complexity of the partial parsing task increases, however, our approach lags the performance of competing approaches. We explain these differences in terms of the knowledge sources employed by each method and describe a number of features ...

Cite

Text

Cardie et al. "Combining Error-Driven Pruning and Classification for Partial Parsing." International Conference on Machine Learning, 1999.

Markdown

[Cardie et al. "Combining Error-Driven Pruning and Classification for Partial Parsing." International Conference on Machine Learning, 1999.](https://mlanthology.org/icml/1999/cardie1999icml-combining/)

BibTeX

@inproceedings{cardie1999icml-combining,
  title     = {{Combining Error-Driven Pruning and Classification for Partial Parsing}},
  author    = {Cardie, Claire and Mardis, Scott Anthony and Pierce, David R.},
  booktitle = {International Conference on Machine Learning},
  year      = {1999},
  pages     = {87-96},
  url       = {https://mlanthology.org/icml/1999/cardie1999icml-combining/}
}