Song, S., Park, M., & Kim, G. (2026). MAVIS: A Benchmark for Multimodal Source Attribution in Long-form Visual Question Answering. Proceedings of the AAAI Conference on Artificial Intelligence, 40(39), 33028–33037. https://doi.org/10.1609/aaai.v40i39.40585