Interactive Dual Generative Adversarial Networks for Image Captioning
Abstract
Image captioning is usually built on either generation-based or retrieval-based approaches. Both ways have certain strengths but suffer from their own limitations. In this paper, we propose an Interactive Dual Generative Adversarial Network (IDGAN) for image captioning, which mutually combines the retrieval-based and generation-based methods to learn a better image captioning ensemble. IDGAN consists of two generators and two discriminators, where the generation- and retrieval-based generators mutually benefit from each other's complementary targets that are learned from two dual adversarial discriminators. Specifically, the generation- and retrieval-based generators provide improved synthetic and retrieved candidate captions with informative feedback signals from the two respective discriminators that are trained to distinguish the generated captions from the true captions and assign top rankings to true captions respectively, thus featuring the merits of both retrieval-based and generation-based approaches. Extensive experiments on MSCOCO dataset demonstrate that the proposed IDGAN model significantly outperforms the compared methods for image captioning.
Cite
Text
Liu et al. "Interactive Dual Generative Adversarial Networks for Image Captioning." AAAI Conference on Artificial Intelligence, 2020. doi:10.1609/AAAI.V34I07.6826Markdown
[Liu et al. "Interactive Dual Generative Adversarial Networks for Image Captioning." AAAI Conference on Artificial Intelligence, 2020.](https://mlanthology.org/aaai/2020/liu2020aaai-interactive-a/) doi:10.1609/AAAI.V34I07.6826BibTeX
@inproceedings{liu2020aaai-interactive-a,
title = {{Interactive Dual Generative Adversarial Networks for Image Captioning}},
author = {Liu, Junhao and Wang, Kai and Xu, Chunpu and Zhao, Zhou and Xu, Ruifeng and Shen, Ying and Yang, Min},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2020},
pages = {11588-11595},
doi = {10.1609/AAAI.V34I07.6826},
url = {https://mlanthology.org/aaai/2020/liu2020aaai-interactive-a/}
}