Tensor Decomposition for Fast Parsing with Latent-Variable PCFGs

Abstract

We describe an approach to speed-up inference with latent variable PCFGs, which have been shown to be highly effective for natural language parsing. Our approach is based on a tensor formulation recently introduced for spectral estimation of latent-variable PCFGs coupled with a tensor decomposition algorithm well-known in the multilinear algebra literature. We also describe an error bound for this approximation, which bounds the difference between the probabilities calculated by the algorithm and the true probabilities that the approximated model gives. Empirical evaluation on real-world natural language parsing data demonstrates a significant speed-up at minimal cost for parsing performance.

Cite

Text

Collins and Cohen. "Tensor Decomposition for Fast Parsing with Latent-Variable PCFGs." Neural Information Processing Systems, 2012.

Markdown

[Collins and Cohen. "Tensor Decomposition for Fast Parsing with Latent-Variable PCFGs." Neural Information Processing Systems, 2012.](https://mlanthology.org/neurips/2012/collins2012neurips-tensor/)

BibTeX

@inproceedings{collins2012neurips-tensor,
  title     = {{Tensor Decomposition for Fast Parsing with Latent-Variable PCFGs}},
  author    = {Collins, Michael and Cohen, Shay B.},
  booktitle = {Neural Information Processing Systems},
  year      = {2012},
  pages     = {2519-2527},
  url       = {https://mlanthology.org/neurips/2012/collins2012neurips-tensor/}
}