Category-Guided Visual Question Generation (Student Abstract)

Authors

  • Hongfei Liu School of Software Engineering, South China University of Technology, Guangzhou, China Key Laboratory of Big Data and Intelligent Robot (South China University of Technology), Ministry of Education
  • Jiali Chen School of Software Engineering, South China University of Technology, Guangzhou, China Key Laboratory of Big Data and Intelligent Robot (South China University of Technology), Ministry of Education
  • Wenhao Fang School of Software Engineering, South China University of Technology, Guangzhou, China Key Laboratory of Big Data and Intelligent Robot (South China University of Technology), Ministry of Education
  • Jiayuan Xie School of Software Engineering, South China University of Technology, Guangzhou, China Key Laboratory of Big Data and Intelligent Robot (South China University of Technology), Ministry of Education
  • Yi Cai School of Software Engineering, South China University of Technology, Guangzhou, China Key Laboratory of Big Data and Intelligent Robot (South China University of Technology), Ministry of Education

DOI:

https://doi.org/10.1609/aaai.v37i13.26991

Keywords:

Visual Question Generation, Diversity Generation, Multimodal

Abstract

Visual question generation aims to generate high-quality questions related to images. Generating questions based only on images can better reduce labor costs and thus be easily applied. However, their methods tend to generate similar general questions that fail to ask questions about the specific content of each image scene. In this paper, we propose a category-guided visual question generation model that can generate questions with multiple categories that focus on different objects in an image. Specifically, our model first selects the appropriate question category based on the objects in the image and the relationships among objects. Then, we generate corresponding questions based on the selected question categories. Experiments conducted on the TDIUC dataset show that our proposed model outperforms existing models in terms of diversity and quality.

Downloads

Published

2023-09-06

How to Cite

Liu, H., Chen, J., Fang, W., Xie, J., & Cai, Y. (2023). Category-Guided Visual Question Generation (Student Abstract). Proceedings of the AAAI Conference on Artificial Intelligence, 37(13), 16262-16263. https://doi.org/10.1609/aaai.v37i13.26991