Mo, Wentao, and Yang Liu. “Bridging the Gap Between 2D and 3D Visual Question Answering: A Fusion Approach for 3D VQA”. Proceedings of the AAAI Conference on Artificial Intelligence 38, no. 5 (March 24, 2024): 4261-4268. Accessed August 6, 2024. https://ojs.aaai.org/index.php/AAAI/article/view/28222.