Zong, D., & Sun, S. (2023). RETRACTED: McOmet: Multimodal Fusion Transformer for Physical Audiovisual Commonsense Reasoning. Proceedings of the AAAI Conference on Artificial Intelligence, 37(5), 6621-6629. https://doi.org/10.1609/aaai.v37i5.25813