Category-Guided Visual Question Generation (Student Abstract)

Abstract

Visual question generation aims to generate high-quality questions related to images. Generating questions based only on images can better reduce labor costs and thus be easily applied. However, their methods tend to generate similar general questions that fail to ask questions about the specific content of each image scene. In this paper, we propose a category-guided visual question generation model that can generate questions with multiple categories that focus on different objects in an image. Specifically, our model first selects the appropriate question category based on the objects in the image and the relationships among objects. Then, we generate corresponding questions based on the selected question categories. Experiments conducted on the TDIUC dataset show that our proposed model outperforms existing models in terms of diversity and quality.

Cite

Text

Liu et al. "Category-Guided Visual Question Generation (Student Abstract)." AAAI Conference on Artificial Intelligence, 2023. doi:10.1609/AAAI.V37I13.26991

Markdown

[Liu et al. "Category-Guided Visual Question Generation (Student Abstract)." AAAI Conference on Artificial Intelligence, 2023.](https://mlanthology.org/aaai/2023/liu2023aaai-category/) doi:10.1609/AAAI.V37I13.26991

BibTeX

@inproceedings{liu2023aaai-category,
  title     = {{Category-Guided Visual Question Generation (Student Abstract)}},
  author    = {Liu, Hongfei and Chen, Jiali and Fang, Wenhao and Xie, Jiayuan and Cai, Yi},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2023},
  pages     = {16262-16263},
  doi       = {10.1609/AAAI.V37I13.26991},
  url       = {https://mlanthology.org/aaai/2023/liu2023aaai-category/}
}