Grammar as a Foreign Language
Abstract
Syntactic constituency parsing is a fundamental problem in naturallanguage processing which has been the subject of intensive researchand engineering for decades. As a result, the most accurate parsersare domain specific, complex, and inefficient. In this paper we showthat the domain agnostic attention-enhanced sequence-to-sequence modelachieves state-of-the-art results on the most widely used syntacticconstituency parsing dataset, when trained on a large synthetic corpusthat was annotated using existing parsers. It also matches theperformance of standard parsers when trained on a smallhuman-annotated dataset, which shows that this model is highlydata-efficient, in contrast to sequence-to-sequence models without theattention mechanism. Our parser is also fast, processing over ahundred sentences per second with an unoptimized CPU implementation.
Cite
Text
Vinyals et al. "Grammar as a Foreign Language." Neural Information Processing Systems, 2015.Markdown
[Vinyals et al. "Grammar as a Foreign Language." Neural Information Processing Systems, 2015.](https://mlanthology.org/neurips/2015/vinyals2015neurips-grammar/)BibTeX
@inproceedings{vinyals2015neurips-grammar,
title = {{Grammar as a Foreign Language}},
author = {Vinyals, Oriol and Kaiser, Łukasz and Koo, Terry and Petrov, Slav and Sutskever, Ilya and Hinton, Geoffrey},
booktitle = {Neural Information Processing Systems},
year = {2015},
pages = {2773-2781},
url = {https://mlanthology.org/neurips/2015/vinyals2015neurips-grammar/}
}