Ye, Q., W. Zeng, M. Liu, J. Zhang, Y. Hu, Z. Yu, and Y. Zhou. “When Eyes and Ears Disagree: Can MLLMs Discern Audio-Visual Confusion?”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 40, no. 14, Mar. 2026, pp. 11955-63, doi:10.1609/aaai.v40i14.38183.