Using Decision Trees to Construct a Practical Parser

Abstract

This paper describes a novel and practical Japanese parser that uses decision trees. First, we construct a single decision tree to estimate modification probabilities; how one phrase tends to modify another. Next, we introduce a boosting algorithm in which several decision trees are constructed and then combined for probability estimation. The constructed parsers are evaluated using the EDR Japanese annotated corpus. The single-tree method significantly outperforms the conventional Japanese stochastic methods. Moreover, the boosted version of the parser is shown to have great advantages; (1) a better parsing accuracy than its single-tree counterpart for any amount of training data and (2) no over-fitting to data for various iterations. The presented parser, the first non-English stochastic parser with practical performance, should tighten the coupling between natural language processing and machine learning.

Cite

Text

Haruno et al. "Using Decision Trees to Construct a Practical Parser." Machine Learning, 1999. doi:10.1023/A:1007597902467

Markdown

[Haruno et al. "Using Decision Trees to Construct a Practical Parser." Machine Learning, 1999.](https://mlanthology.org/mlj/1999/haruno1999mlj-using/) doi:10.1023/A:1007597902467

BibTeX

@article{haruno1999mlj-using,
  title     = {{Using Decision Trees to Construct a Practical Parser}},
  author    = {Haruno, Masahiko and Shirai, Satoshi and Ooyama, Yoshifumi},
  journal   = {Machine Learning},
  year      = {1999},
  pages     = {131-149},
  doi       = {10.1023/A:1007597902467},
  volume    = {34},
  url       = {https://mlanthology.org/mlj/1999/haruno1999mlj-using/}
}