[1]
S. Yang, S. Luo, and S. C. Han, “Multimodal Commonsense Knowledge Distillation for Visual Question Answering (Student Abstract)”, AAAI, vol. 39, no. 28, pp. 29545–29547, Apr. 2025.