End-to-End Optimization of Goal-Driven and Visually Grounded Dialogue Systems
Abstract
End-to-end design of dialogue systems has recently become a popular research topic thanks to powerful tools such as encoder-decoder architectures for sequence-to-sequence learning. Yet, most current approaches cast human-machine dialogue management as a supervised learning problem, aiming at predicting the next utterance of a participant given the full history of the dialogue. This vision is too simplistic to render the intrinsic planning problem inherent to dialogue as well as its grounded nature , making the context of a dialogue larger than the sole history. This is why only chitchat and question answering tasks have been addressed so far using end-to-end architectures. In this paper, we introduce a Deep Reinforcement Learning method to optimize visually grounded task-oriented dialogues , based on the policy gradient algorithm. This approach is tested on a dataset of 120k dialogues collected through Mechanical Turk and provides encouraging results at solving both the problem of generating natural dialogues and the task of discovering a specific object in a complex picture.
Cite
Text
Strub et al. "End-to-End Optimization of Goal-Driven and Visually Grounded Dialogue Systems." International Joint Conference on Artificial Intelligence, 2017. doi:10.24963/IJCAI.2017/385Markdown
[Strub et al. "End-to-End Optimization of Goal-Driven and Visually Grounded Dialogue Systems." International Joint Conference on Artificial Intelligence, 2017.](https://mlanthology.org/ijcai/2017/strub2017ijcai-end/) doi:10.24963/IJCAI.2017/385BibTeX
@inproceedings{strub2017ijcai-end,
title = {{End-to-End Optimization of Goal-Driven and Visually Grounded Dialogue Systems}},
author = {Strub, Florian and de Vries, Harm and Mary, Jérémie and Piot, Bilal and Courville, Aaron C. and Pietquin, Olivier},
booktitle = {International Joint Conference on Artificial Intelligence},
year = {2017},
pages = {2765-2771},
doi = {10.24963/IJCAI.2017/385},
url = {https://mlanthology.org/ijcai/2017/strub2017ijcai-end/}
}