[1]

Zhang, W., Shi, H., Tang, S., Xiao, J., Yu, Q. and Zhuang, Y. 2021. Consensus Graph Representation Learning for Better Grounded Image Captioning. Proceedings of the AAAI Conference on Artificial Intelligence. 35, 4 (May 2021), 3394-3402. DOI:https://doi.org/10.1609/aaai.v35i4.16452.