Ben-younes, Hedi, Remi Cadene, Nicolas Thome, and Matthieu Cord. “BLOCK: Bilinear Superdiagonal Fusion for Visual Question Answering and Visual Relationship Detection”. Proceedings of the AAAI Conference on Artificial Intelligence 33, no. 01 (July 17, 2019): 8102–8109. Accessed May 30, 2026. https://ojs.aaai.org/index.php/AAAI/article/view/4818.