Xie, L., Kuthiala, A., Wei, G. Z., Zheng, C., Bal, A., Dabhi, M., … Jeni, L. A. (2026). MAVERIX: Multimodal Audio-Visual Evaluation and Recognition IndeX. Proceedings of the AAAI Conference on Artificial Intelligence, 40(32), 27090–27098. https://doi.org/10.1609/aaai.v40i32.39923