Huang, Zhiqi, Fenglin Liu, Xian Wu, Shen Ge, Helin Wang, Wei Fan, and Yuexian Zou. “Audio-Oriented Multimodal Machine Comprehension via Dynamic Inter- and Intra-Modality Attention”. Proceedings of the AAAI Conference on Artificial Intelligence 35, no. 14 (May 18, 2021): 13098-13106. Accessed April 24, 2024. https://ojs.aaai.org/index.php/AAAI/article/view/17548.