[1]
D. Zong and S. Sun, “RETRACTED: McOmet: Multimodal Fusion Transformer for Physical Audiovisual Commonsense Reasoning”, AAAI, vol. 37, no. 5, pp. 6621-6629, Jun. 2023.