Gu, J., J. Cai, G. Wang, and T. Chen. “Stack-Captioning: Coarse-to-Fine Learning for Image Captioning”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 32, no. 1, Apr. 2018, doi:10.1609/aaai.v32i1.12266.