Category-Guided Visual Question Generation (Student Abstract)
DOI:
https://doi.org/10.1609/aaai.v37i13.26991Keywords:
Visual Question Generation, Diversity Generation, MultimodalAbstract
Visual question generation aims to generate high-quality questions related to images. Generating questions based only on images can better reduce labor costs and thus be easily applied. However, their methods tend to generate similar general questions that fail to ask questions about the specific content of each image scene. In this paper, we propose a category-guided visual question generation model that can generate questions with multiple categories that focus on different objects in an image. Specifically, our model first selects the appropriate question category based on the objects in the image and the relationships among objects. Then, we generate corresponding questions based on the selected question categories. Experiments conducted on the TDIUC dataset show that our proposed model outperforms existing models in terms of diversity and quality.Downloads
Published
2024-07-15
How to Cite
Liu, H., Chen, J., Fang, W., Xie, J., & Cai, Y. (2024). Category-Guided Visual Question Generation (Student Abstract). Proceedings of the AAAI Conference on Artificial Intelligence, 37(13), 16262-16263. https://doi.org/10.1609/aaai.v37i13.26991
Issue
Section
AAAI Student Abstract and Poster Program