[1]
Zhang, W., Shi, H., Tang, S., Xiao, J., Yu, Q. and Zhuang, Y. 2021. Consensus Graph Representation Learning for Better Grounded Image Captioning. Proceedings of the AAAI Conference on Artificial Intelligence. 35, 4 (May 2021), 3394-3402.