NAPS: Natural Program Synthesis Dataset

Abstract

We present a program synthesis-oriented dataset consisting of human written problem statements and solutions for these problems. The problem statements were collected via crowdsourcing and the program solutions were extracted from human-written solutions in programming competitions, accompanied by input/output examples. We propose using this dataset for the program synthesis tasks aimed at working with real user-generated data. As a baseline, we present few models, with the best model achieving 5.6% accuracy, showcasing both complexity of the dataset and large room for future research.

Cite

Text

Zavershynskyi et al. "NAPS: Natural Program Synthesis Dataset." ICML 2018 Workshops: NAMPI, 2018.

Markdown

[Zavershynskyi et al. "NAPS: Natural Program Synthesis Dataset." ICML 2018 Workshops: NAMPI, 2018.](https://mlanthology.org/icmlw/2018/zavershynskyi2018icmlw-naps/)

BibTeX

@inproceedings{zavershynskyi2018icmlw-naps,
  title     = {{NAPS: Natural Program Synthesis Dataset}},
  author    = {Zavershynskyi, Maksym and Skidanov, Alex and Polosukhin, Illia},
  booktitle = {ICML 2018 Workshops: NAMPI},
  year      = {2018},
  url       = {https://mlanthology.org/icmlw/2018/zavershynskyi2018icmlw-naps/}
}