Bimodal Modelling of Source Code and Natural Language
Abstract
We consider the problem of building probabilistic models that jointly model short natural language utterances and source code snippets. The aim is to bring together recent work on statistical modelling of source code and work on bimodal models of images and natural language. The resulting models are useful for a variety of tasks that involve natural language and source code. We demonstrate their performance on two retrieval tasks: retrieving source code snippets given a natural language query, and retrieving natural language descriptions given a source code query (i.e., source code captioning). The experiments show there to be promise in this direction, and that modelling the structure of source code is helpful towards the retrieval tasks.
Cite
Text
Allamanis et al. "Bimodal Modelling of Source Code and Natural Language." International Conference on Machine Learning, 2015.Markdown
[Allamanis et al. "Bimodal Modelling of Source Code and Natural Language." International Conference on Machine Learning, 2015.](https://mlanthology.org/icml/2015/allamanis2015icml-bimodal/)BibTeX
@inproceedings{allamanis2015icml-bimodal,
title = {{Bimodal Modelling of Source Code and Natural Language}},
author = {Allamanis, Miltos and Tarlow, Daniel and Gordon, Andrew and Wei, Yi},
booktitle = {International Conference on Machine Learning},
year = {2015},
pages = {2123-2132},
volume = {37},
url = {https://mlanthology.org/icml/2015/allamanis2015icml-bimodal/}
}