Lu, Pan, Hongsheng Li, Wei Zhang, Jianyong Wang, and Xiaogang Wang. “Co-Attending Free-Form Regions and Detections With Multi-Modal Multiplicative Feature Embedding for Visual Question Answering”. Proceedings of the AAAI Conference on Artificial Intelligence 32, no. 1 (April 27, 2018). Accessed April 19, 2024. https://ojs.aaai.org/index.php/AAAI/article/view/12240.