NAPS: Natural Program Synthesis Dataset
Abstract
We present a program synthesis-oriented dataset consisting of human written problem statements and solutions for these problems. The problem statements were collected via crowdsourcing and the program solutions were extracted from human-written solutions in programming competitions, accompanied by input/output examples. We propose using this dataset for the program synthesis tasks aimed at working with real user-generated data. As a baseline, we present few models, with the best model achieving 5.6% accuracy, showcasing both complexity of the dataset and large room for future research.
Cite
Text
Zavershynskyi et al. "NAPS: Natural Program Synthesis Dataset." ICML 2018 Workshops: NAMPI, 2018.Markdown
[Zavershynskyi et al. "NAPS: Natural Program Synthesis Dataset." ICML 2018 Workshops: NAMPI, 2018.](https://mlanthology.org/icmlw/2018/zavershynskyi2018icmlw-naps/)BibTeX
@inproceedings{zavershynskyi2018icmlw-naps,
title = {{NAPS: Natural Program Synthesis Dataset}},
author = {Zavershynskyi, Maksym and Skidanov, Alex and Polosukhin, Illia},
booktitle = {ICML 2018 Workshops: NAMPI},
year = {2018},
url = {https://mlanthology.org/icmlw/2018/zavershynskyi2018icmlw-naps/}
}