Word Embedding Based Correlation Model for Question/Answer Matching

Abstract

The large scale of Q&A archives accumulated in community based question answering (CQA) servivces are important information and knowledge resource on the web. Question and answer matching task has been attached much importance to for its ability to reuse knowledge stored in these systems: it can be useful in enhancing user experience with recurrent questions. In this paper, a Word Embedding based Correlation (WEC) model is proposed by integrating advantages of both the translation model and word embedding. Given a random pair of words, WEC can score their co-occurrence probability in Q&A pairs, while it can also leverage the continuity and smoothness of continuous space word representation to deal with new pairs of words that are rare in the training parallel text. An experimental study on Yahoo! Answers dataset and Baidu Zhidao dataset shows this new method's promising potential.

Cite

Text

Shen et al. "Word Embedding Based Correlation Model for Question/Answer Matching." AAAI Conference on Artificial Intelligence, 2017. doi:10.1609/AAAI.V31I1.11002

Markdown

[Shen et al. "Word Embedding Based Correlation Model for Question/Answer Matching." AAAI Conference on Artificial Intelligence, 2017.](https://mlanthology.org/aaai/2017/shen2017aaai-word/) doi:10.1609/AAAI.V31I1.11002

BibTeX

@inproceedings{shen2017aaai-word,
  title     = {{Word Embedding Based Correlation Model for Question/Answer Matching}},
  author    = {Shen, Yikang and Rong, Wenge and Jiang, Nan and Peng, Baolin and Tang, Jie and Xiong, Zhang},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2017},
  pages     = {3511-3517},
  doi       = {10.1609/AAAI.V31I1.11002},
  url       = {https://mlanthology.org/aaai/2017/shen2017aaai-word/}
}