Guo, Z., Liang, C., Wan, Z., & Bai, Y. (2021). Global Fusion Attention for Vision and Language Understanding (Student Abstract). Proceedings of the AAAI Conference on Artificial Intelligence, 35(18), 15789-15790. Retrieved from https://ojs.aaai.org/index.php/AAAI/article/view/17891