[1]
Wang, Z., Bao, R., Wu, Q. and Liu, S. 2021. Confidence-aware Non-repetitive Multimodal Transformers for TextCaps. Proceedings of the AAAI Conference on Artificial Intelligence. 35, 4 (May 2021), 2835-2843.