[1]
Z. Fei, “Attention-Aligned Transformer for Image Captioning”, AAAI, vol. 36, no. 1, pp. 607-615, Jun. 2022.