Tree Sequence Kernel for Natural Language

Abstract

We propose Tree Sequence Kernel (TSK), which implicitly exhausts the structure features of a sequence of subtrees embedded in the phrasal parse tree. By incorporating the capability of sequence kernel, TSK enriches tree kernel with tree sequence features so that it may provide additional useful patterns for machine learning applications. Two approaches of penalizing the substructures are proposed and both can be accomplished by efficient algorithms via dynamic programming. Evaluations are performed on two natural language tasks, i.e. Question Classification and Relation Extraction. Experimental results suggest that TSK outperforms tree kernel for both tasks, which also reveals that the structure features made up of multiple subtrees are effective and play a complementary role to the single tree structure.

Cite

Text

Sun et al. "Tree Sequence Kernel for Natural Language." AAAI Conference on Artificial Intelligence, 2011. doi:10.1609/AAAI.V25I1.7973

Markdown

[Sun et al. "Tree Sequence Kernel for Natural Language." AAAI Conference on Artificial Intelligence, 2011.](https://mlanthology.org/aaai/2011/sun2011aaai-tree/) doi:10.1609/AAAI.V25I1.7973

BibTeX

@inproceedings{sun2011aaai-tree,
  title     = {{Tree Sequence Kernel for Natural Language}},
  author    = {Sun, Jun and Zhang, Min and Tan, Chew Lim},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2011},
  pages     = {921-926},
  doi       = {10.1609/AAAI.V25I1.7973},
  url       = {https://mlanthology.org/aaai/2011/sun2011aaai-tree/}
}