Lao, Mingrui, Nan Pu, Yu Liu, Kai He, Erwin M. Bakker, and Michael S. Lew. “COCA: COllaborative CAusal Regularization for Audio-Visual Question Answering”. Proceedings of the AAAI Conference on Artificial Intelligence 37, no. 11 (June 26, 2023): 12995-13003. Accessed July 11, 2024. https://ojs.aaai.org/index.php/AAAI/article/view/26527.