Grammar as a Foreign Language

Abstract

Syntactic constituency parsing is a fundamental problem in naturallanguage processing which has been the subject of intensive researchand engineering for decades. As a result, the most accurate parsersare domain specific, complex, and inefficient. In this paper we showthat the domain agnostic attention-enhanced sequence-to-sequence modelachieves state-of-the-art results on the most widely used syntacticconstituency parsing dataset, when trained on a large synthetic corpusthat was annotated using existing parsers. It also matches theperformance of standard parsers when trained on a smallhuman-annotated dataset, which shows that this model is highlydata-efficient, in contrast to sequence-to-sequence models without theattention mechanism. Our parser is also fast, processing over ahundred sentences per second with an unoptimized CPU implementation.

Cite

Text

Vinyals et al. "Grammar as a Foreign Language." Neural Information Processing Systems, 2015.

Markdown

[Vinyals et al. "Grammar as a Foreign Language." Neural Information Processing Systems, 2015.](https://mlanthology.org/neurips/2015/vinyals2015neurips-grammar/)

BibTeX

@inproceedings{vinyals2015neurips-grammar,
  title     = {{Grammar as a Foreign Language}},
  author    = {Vinyals, Oriol and Kaiser, Łukasz and Koo, Terry and Petrov, Slav and Sutskever, Ilya and Hinton, Geoffrey},
  booktitle = {Neural Information Processing Systems},
  year      = {2015},
  pages     = {2773-2781},
  url       = {https://mlanthology.org/neurips/2015/vinyals2015neurips-grammar/}
}