Response Enhanced Semi-Supervised Dialogue Query Generation

Abstract

Leveraging vast and continually updated knowledge from the Internet has been considered an important ability for a dialogue system. Therefore, the dialogue query generation task is proposed for generating search queries from dialogue histories, which will be submitted to a search engine for retrieving relevant websites on the Internet. In this regard, previous efforts were devoted to collecting conversations with annotated queries and training a query producer (QP) via standard supervised learning. However, these studies still face the challenges of data scarcity and domain adaptation. To address these issues, in this paper, we propose a semi-supervised learning framework -- SemiDQG, to improve model performance with unlabeled conversations. Based on the observation that the search query is typically related to the topic of dialogue response, we train a response-augmented query producer (RA) to provide rich and effective training signals for QP. We first apply a similarity-based query selection strategy to select high-quality RA-generated pseudo queries, which are used to construct pseudo instances for training QP and RA. Then, we adopt the REINFORCE algorithm to further enhance QP, with RA-provided rewards as fine-grained training signals. Experimental results and in-depth analysis of three benchmarks show the effectiveness of our framework in cross-domain and low-resource scenarios. Particularly, SemiDQG significantly surpasses ChatGPT and competitive baselines. Our code is available at \url{https://github.com/DeepLearnXMU/SemiDQG}.

Cite

Text

Huang et al. "Response Enhanced Semi-Supervised Dialogue Query Generation." AAAI Conference on Artificial Intelligence, 2024. doi:10.1609/AAAI.V38I16.29790

Markdown

[Huang et al. "Response Enhanced Semi-Supervised Dialogue Query Generation." AAAI Conference on Artificial Intelligence, 2024.](https://mlanthology.org/aaai/2024/huang2024aaai-response/) doi:10.1609/AAAI.V38I16.29790

BibTeX

@inproceedings{huang2024aaai-response,
  title     = {{Response Enhanced Semi-Supervised Dialogue Query Generation}},
  author    = {Huang, Jianheng and Wang, Ante and Gao, Linfeng and Song, Linfeng and Su, Jinsong},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2024},
  pages     = {18307-18315},
  doi       = {10.1609/AAAI.V38I16.29790},
  url       = {https://mlanthology.org/aaai/2024/huang2024aaai-response/}
}