Gupta, A., Y. Verma, and C. Jawahar. “Choosing Linguistics over Vision to Describe Images”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 26, no. 1, Sept. 2021, pp. 606-12, doi:10.1609/aaai.v26i1.8205.