Paraphrase Diversification Using Counterfactual Debiasing
Abstract
The problem of generating a set of diverse paraphrase sentences while (1) not compromising the original meaning of the original sentence, and (2) imposing diversity in various semantic aspects, such as a lexical or syntactic structure, is examined. Existing work on paraphrase generation has focused more on the former, and the latter was trained as a fixed style transfer, such as transferring from positive to negative sentiments, even at the cost of losing semantics. In this work, we consider style transfer as a means of imposing diversity, with a paraphrasing correctness constraint that the target sentence must remain a paraphrase of the original sentence. However, our goal is to maximize the diversity for a set of k generated paraphrases, denoted as the diversified paraphrase (DP) problem. Our key contribution is deciding the style guidance at generation towards the direction of increasing the diversity of output with respect to those generated previously. As pre-materializing training data for all style decisions is impractical, we train with biased data, but with debiasing guidance. Compared to state-of-the-art methods, our proposed model can generate more diverse and yet semantically consistent paraphrase sentences. That is, our model, trained with the MSCOCO dataset, achieves the highest embedding scores, .94/.95/.86, similar to state-of-the-art results, but with a lower mBLEU score (more diverse) by 8.73%.
Cite
Text
Park et al. "Paraphrase Diversification Using Counterfactual Debiasing." AAAI Conference on Artificial Intelligence, 2019. doi:10.1609/AAAI.V33I01.33016883Markdown
[Park et al. "Paraphrase Diversification Using Counterfactual Debiasing." AAAI Conference on Artificial Intelligence, 2019.](https://mlanthology.org/aaai/2019/park2019aaai-paraphrase/) doi:10.1609/AAAI.V33I01.33016883BibTeX
@inproceedings{park2019aaai-paraphrase,
title = {{Paraphrase Diversification Using Counterfactual Debiasing}},
author = {Park, Sunghyun and Hwang, Seung-won and Chen, Fuxiang and Choo, Jaegul and Ha, Jung-Woo and Kim, Sunghun and Yim, Jinyeong},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2019},
pages = {6883-6891},
doi = {10.1609/AAAI.V33I01.33016883},
url = {https://mlanthology.org/aaai/2019/park2019aaai-paraphrase/}
}