Wang, Y., Xu, J. and Sun, Y. (2022) “End-to-End Transformer Based Model for Image Captioning”, Proceedings of the AAAI Conference on Artificial Intelligence, 36(3), pp. 2585-2594. doi: 10.1609/aaai.v36i3.20160.