[1]
J. Ji, “Improving Image Captioning by Leveraging Intra- and Inter-layer Global Representation in Transformer Network”, AAAI, vol. 35, no. 2, pp. 1655-1663, May 2021.