End-to-End Transition-Based Online Dialogue Disentanglement
Abstract
Dialogue disentanglement aims to separate intermingled messages into detached sessions. The existing research focuses on two-step architectures, in which a model first retrieves the relationships between two messages and then divides the message stream into separate clusters. Almost all existing work puts significant efforts on selecting features for message-pair classification and clustering, while ignoring the semantic coherence within each session. In this paper, we introduce the first end-to- end transition-based model for online dialogue disentanglement. Our model captures the sequential information of each session as the online algorithm proceeds on processing a dialogue. The coherence in a session is hence modeled when messages are sequentially added into their best-matching sessions. Meanwhile, the research field still lacks data for studying end-to-end dialogue disentanglement, so we construct a large-scale dataset by extracting coherent dialogues from online movie scripts. We evaluate our model on both the dataset we developed and the publicly available Ubuntu IRC dataset [Kummerfeld et al., 2019]. The results show that our model significantly outperforms the existing algorithms. Further experiments demonstrate that our model better captures the sequential semantics and obtains more coherent disentangled sessions.
Cite
Text
Liu et al. "End-to-End Transition-Based Online Dialogue Disentanglement." International Joint Conference on Artificial Intelligence, 2020. doi:10.24963/IJCAI.2020/535Markdown
[Liu et al. "End-to-End Transition-Based Online Dialogue Disentanglement." International Joint Conference on Artificial Intelligence, 2020.](https://mlanthology.org/ijcai/2020/liu2020ijcai-end/) doi:10.24963/IJCAI.2020/535BibTeX
@inproceedings{liu2020ijcai-end,
title = {{End-to-End Transition-Based Online Dialogue Disentanglement}},
author = {Liu, Hui and Shi, Zhan and Gu, Jia-Chen and Liu, Quan and Wei, Si and Zhu, Xiaodan},
booktitle = {International Joint Conference on Artificial Intelligence},
year = {2020},
pages = {3868-3874},
doi = {10.24963/IJCAI.2020/535},
url = {https://mlanthology.org/ijcai/2020/liu2020ijcai-end/}
}