MultiTalk: A Highly-Branching Dialog Testbed for Diverse Conversations
Abstract
We study conversational dialog in which there are many possible responses to a given history. We present the MultiTalk Dataset, a corpus of over 320,000 sentences of written conversational dialog that balances a high branching factor (10) with several conversation turns (6) through selective branch continuation. We make multiple contributions to study dialog generation in the highly branching setting. In order to evaluate a diverse set of generations, we propose a simple scoring algorithm, based on bipartite graph matching, to optimally incorporate a set of diverse references. We study multiple language generation tasks at different levels of predictive conversation depth, using textual attributes induced automatically from pretrained classifiers. Our culminating task is a challenging theory of mind problem, a controllable generation task which requires reasoning about the expected reaction of the listener.
Cite
Text
Dou et al. "MultiTalk: A Highly-Branching Dialog Testbed for Diverse Conversations." AAAI Conference on Artificial Intelligence, 2021. doi:10.1609/AAAI.V35I14.17510Markdown
[Dou et al. "MultiTalk: A Highly-Branching Dialog Testbed for Diverse Conversations." AAAI Conference on Artificial Intelligence, 2021.](https://mlanthology.org/aaai/2021/dou2021aaai-multitalk/) doi:10.1609/AAAI.V35I14.17510BibTeX
@inproceedings{dou2021aaai-multitalk,
title = {{MultiTalk: A Highly-Branching Dialog Testbed for Diverse Conversations}},
author = {Dou, Yao and Forbes, Maxwell and Holtzman, Ari and Choi, Yejin},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2021},
pages = {12760-12767},
doi = {10.1609/AAAI.V35I14.17510},
url = {https://mlanthology.org/aaai/2021/dou2021aaai-multitalk/}
}