Xie, L. (2026) “MAVERIX: Multimodal Audio-Visual Evaluation and Recognition IndeX”, Proceedings of the AAAI Conference on Artificial Intelligence, 40(32), pp. 27090–27098. doi: 10.1609/aaai.v40i32.39923.