Mo, W., & Liu, Y. (2024). Bridging the Gap between 2D and 3D Visual Question Answering: A Fusion Approach for 3D VQA. Proceedings of the AAAI Conference on Artificial Intelligence, 38(5), 4261–4268. https://doi.org/10.1609/aaai.v38i5.28222