Contrastive Latent Variable Models for Neural Text Generation
Abstract
Deep latent variable models such as variational autoencoders and energy-based models are widely used for neural text generation. Most of them focus on matching the prior distribution with the posterior distribution of the latent variable for text reconstruction. In addition to instance-level reconstruction, this paper aims to integrate contrastive learning in the latent space, forcing the latent variables to learn high-level semantics by exploring inter-instance relationships. Experiments on various text generation benchmarks show the effectiveness of our proposed method. We also empirically show that our method can mitigate the posterior collapse issue for latent variable based text generation models.
Cite
Text
Teng et al. "Contrastive Latent Variable Models for Neural Text Generation." Uncertainty in Artificial Intelligence, 2022.Markdown
[Teng et al. "Contrastive Latent Variable Models for Neural Text Generation." Uncertainty in Artificial Intelligence, 2022.](https://mlanthology.org/uai/2022/teng2022uai-contrastive/)BibTeX
@inproceedings{teng2022uai-contrastive,
title = {{Contrastive Latent Variable Models for Neural Text Generation}},
author = {Teng, Zhiyang and Chen, Chenhua and Zhang, Yan and Zhang, Yue},
booktitle = {Uncertainty in Artificial Intelligence},
year = {2022},
pages = {1928-1938},
volume = {180},
url = {https://mlanthology.org/uai/2022/teng2022uai-contrastive/}
}