[1]
Y. Han, Y. Hu, X. Song, H. Tang, M. Xu, and L. Nie, “Exploiting the Social-Like Prior in Transformer for Visual Reasoning”, AAAI, vol. 38, no. 3, pp. 2058–2066, Mar. 2024.